Site Reliability Engineer
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Design, deploy, and operate observability pipelines for logs, metrics, traces, and alerts across Proton's services using open-source technologies.
- Partner with development and platform teams to ship practical alerting, dashboarding, and integration solutions that engineers actually rely on.
- Build reusable templates and tooling that streamline onboarding, incident response, and analysis.
- Champion observability best practices across teams and raise the bar for how Proton instruments its systems.
- Build AI-powered tooling that sharpens detection, analysis, and response capabilities.
- Evolve the observability platform iteratively to meet the real needs of internal stakeholders.
Requirements
- Extensive experience in an SRE, DevOps, or Platform Engineering role.
- Comfortable writing Python and/or Go for tooling and automation.
- Hands-on experience operating open-source observability stacks (logs, metrics, traces, alerting).
- Working knowledge of Kubernetes and GitOps workflows.
- Practical experience with infrastructure-as-code (Terraform, Ansible, Puppet, or similar) and solid Linux system administration skills.
- Familiarity with OpenTelemetry.
- Experience running ClickHouse for log and metric storage at scale.
- Interest in or experience with AI/ML tooling.
Benefits
Additional Information
Join Proton and build a better internet where privacy is the default Proton was founded in 2014 by scientists from CERN on a simple truth: privacy is a fundamental human right . Since then, we've built the world's largest encrypted email service (Proton Mail) and expanded into Proton VPN, Proton Drive, Proton Pass, and Proton Calendar - tools used by millions globally to protect their freedom, fight censorship, and keep their data safe. In some situations, Proton has literally helped save lives. We are profitable, independent (no VC control), and selectively hire from the top ~1% of applicants. Our 700+ team members across 50+ countries come from leading organizations and elite academic backgrounds. We move fast, keep hierarchy light, and prioritize impact over optics. If you want to do meaningful work with exceptionally high-caliber people, this is it. Check our open-source projects here . Purpose of the role: We're a small, tool-agnostic team that owns the observability infrastructure behind Proton's services - the logs, metrics, traces, and alerts that keep systems running smoothly for the millions of users who trust us with their privacy. We run on open-source stacks across Proton's on-premise data centers, and we dogfood heavily: we're our own first customers. We favor simple, solid solutions over large engineering efforts, and we believe good systems emerge iteratively. You'll join a group that values frank, open communication and a problem-solving mentality - if you want narrow scope and a fixed backlog, this isn't the right fit. Tech Stack and Tools Languages: Python, Go Observability: open-source stacks for logs, metrics, traces, alerting; OpenTelemetry Orchestration: Kubernetes GitOps: ArgoCD Infrastructure-as-code: Terraform, Ansible, Puppet Storage at scale: ClickHouse Platform: Linux, on-premise data centers
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at proton? Share your experience