Staff Software Engineer, Datalake Platform
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
About Stripe Stripe is a financial infrastructure platform for businesses. Millions of companies - from the world's largest enterprises to the most ambitious startups - use Stripe to accept payments, grow their revenue, and accelerate new business opportunities. Our mission is to increase the GDP of the internet, and we have a staggering amount of work ahead. That means you have an unprecedented opportunity to put the global economy within everyone's reach while doing the most important work of your career. The Datalake team builds and maintains Stripe's foundational data access and governance infrastructure - the paved path for safe, fast, and compliant access to Stripe's critical big data assets. We serve developers, data engineers, analysts, ML and AI teams, security teams, and business users across the company. The team is in the middle of a significant architectural transition as Stripe grows. We are making Stripe's data lake a first-class citizen of the modern data ecosystem to support our growing scale and diverse workloads. What Makes This Role Compelling Foundational infrastructure with broad reach: The Datalake team's systems sit in the critical path of nearly every data workload at Stripe. Decisions affect petabytes of data, hundreds of production pipelines, and every engineering team that builds on Stripe's data lake. Active, high-stakes architectural transformation: The team is executing a multi-year migration to modern, OSS-aligned solutions - a technically deep project with real architectural choices at each step, including API design, compute engine integration, authorization model, and per-table credential vending. Active, high-stakes, OSS-aligned architectural transformation: You will lead a multi-year migration to modern, open-source solutions like the Apache Iceberg REST Catalog. This is a technically deep project involving critical architectural choices at each step, from API design and compute engine integration to authorization models, where your opinions and technical influence will directly shape how the platform engages with the broader data infrastructure ecosystem. Storage platform ownership with room to define the approach: The team owns the object storage abstraction layer - access control, IAM policy design, lifecycle management, and compliance architecture - but the how is still being written. You'll shape how hundreds of engineering teams interact with petabytes of data, and the decisions you make will stick. At Stripe you'll have the scale of the large company and the agency to influence technical strategy and the roadmap
Responsibilities
- Lead compliance architecture: Partner with security and compliance teams to translate regulatory requirements into durable preventative technical controls - audit logging, access review infrastructure, data segregation, and lifecycle enforcement - built into the platform rather than bolted on.
- Drive cost and efficiency at petabyte scale: Identify systemic inefficiencies in storage layout, snapshot retention, and data lifecycle, and design automated, self-service tooling that scales without ongoing manual intervention from the team.
Requirements
- Minimum requirements
- 10+ years of professional software engineering experience.
- Demonstrated track record of designing, building, and operating large-scale distributed storage or data infrastructure systems.
- Deep experience with object storage (S3, Azure Blob, or equivalent) - including IAM, access control policy design, lifecycle management, and operational practices at petabyte
Benefits
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Stripe? Share your experience