Skip to main content
Back to jobs

Lead SRE - BeReal

External
voodoo logoVoodoo · Paris, France
Full-timeRemote2d ago
ArgoCDAWSCI/CDDatadogGCPGitHub
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Benefits

Competitive salary based on experienceSwile Lunch voucherGymlib (100% covered by Voodoo)Premium healthcare coverage with SideCare, 100% covered for you and your familyWellness activities in our Paris officeHealth insurance

Additional Information

About BeReal At BeReal, we are dedicated to authenticity in social media. By encouraging users to share unfiltered moments, we foster genuine connections and celebrate real life. We are now an international team of 100+ and have 40M+ monthly active users. Backed by Voodoo, our team is fully focused on scaling BeReal into an iconic social network used by hundreds of millions. The Infrastructure team provides the backbone that powers the company's growth, ensuring the scalability, efficiency, and reliability of our platform. We design and operate our infrastructure on GCP. Working hand in hand with developers, we enable teams to ship fast and efficiently while maintaining a strong focus on costs and performance. Our mission is to create a developer-friendly, cost-effective, and highly automated infrastructure that supports innovation at scale. Role Define and drive SRE practices across the organization, including SLIs, SLOs, error budgets, incident management, postmortem processes, and long-term reliability improvements across the platform Design, implement, and optimize infrastructure for availability, scalability, reliability, and cost efficiency Own and evolve our observability stack, improving monitoring, alerting, logging, and distributed tracing Drive automation of infrastructure and operational workflows (e.g., Terraform, Terragrunt, Kubernetes) Lead FinOps initiatives, developing tools and insights to optimize cloud costs Partner closely with development squads to improve service reliability, performance, and operational excellence Influence architectural decisions and establish best practices for building resilient distributed systems Mentor and support Infrastructure engineers, helping raise the bar on reliability, operational excellence, and technical execution Analyze performance bottlenecks and work on solutions such as scaling strategies, service optimizations, and system debugging Profile Strong knowledge of Kubernetes Experience with high traffic, distributed systems architectures, and related tools (service discovery, config/secret management, etc.) Strong knowledge of one Cloud provider (AWS or GCP preferred) Proven experience defining and operating SRE practices (SLOs, incident management, observability, reliability engineering) Strong operational mindset with experience managing production incidents and driving reliability improvements Leadership and mentoring experience, with the ability to influence technical decisions across teams Ownership-driven - If something isn't working, you don't wait for instructions; you improve it Pragmatic and impact-oriented - You balance reliability, delivery speed, and business priorities Performance vs cost-conscious - You make decisions that align with both technical excellence and financial sustainability Our Stack Operator: Kubernetes CI/CD: Argocd, Github actions Cloud provider: GCP Monitoring: Datadog Infra as code: Terraform / Terragrunt Languages: golang / node Datastores: Spanner / PostgreSQL / Redis


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at voodoo? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect