Skip to main content
Back to jobs

Senior SRE, Ads

External
Reddit logoReddit · Remote
Full-timeRemote1d ago
Capacity PlanningIncident ResponseLinuxObservabilityPythonSite Reliability Engineering
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • Partner with Ads Engineering teams to improve reliability, scalability, and operational excellence of ad-serving, auction, targeting, measurement, and billing systems.
  • Design, build, and maintain infrastructure, tooling, and automation that improve service reliability and engineering productivity.
  • Improve observability through monitoring, alerting, tracing, logging, and dashboards.
  • Participate in on-call rotations and lead incident response efforts for critical production systems.
  • Run root cause analysis and drive corrective actions following incidents.
  • Collaborate with software engineers throughout the service lifecycle, from design reviews through production operations.
  • Drive adoption of SRE best practices including SLIs, SLOs, error budgets, capacity planning, and operational readiness reviews.
  • Reduce operational toil through automation and self-service tooling.
  • Help define and measure advertiser-critical user journeys such as campaign creation, ad delivery, reporting, and billing.
  • Scale Ads systems to support continued traffic growth, increased advertiser demand, and evolving business requirements.
  • Required Qualifications:
  • 5+ years of experience in Site Reliability Engineering, Infrastructure Engineering, or related roles operating large scale distributed systems.
  • Strong experience supporting high traffic, user facing production environments.
  • Good understanding of distributed systems, networking, Linux systems, cloud native architectures.
  • Good programming skills in languages such as Go, Python, or similar.
  • Demonstrated ability to troubleshoot complex issues across applications, infrastructure, networking, and services.
  • Experience with observability platforms, monitoring systems, alerting, and incident response.
  • Experience driving automation and operational improvements.

Benefits

Global Benefit programs that fit your lifestyle, from workspace to professional development to caregiving supportFamily Planning SupportGender-Affirming CareMental Health & Coaching BenefitsPrivate Medical, Dental, and Vision BenefitsPersonal Retirement Savings Account with matching contributionCycle to Work and Tax Saver schemesFlexible Vacation & Paid Volunteer Time OffGenerous Paid Parental LeaveIn select roles and locations, the interviews will be recorded, transcribed and summarized by artificial intelligence (AI). You will have the opportunity to opt out of recording, transcription and summarization prior to any scheduled interviews.Health insuranceDental insuranceVision insurancePaid time offRemote work optionsFlexible scheduleParental leave

Additional Information

Reddit is a community of communities. It's built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 126 million daily active unique visitors, Reddit is one of the internet's largest sources of information. For more information, visit www.redditinc.com . Location: Reddit has a flexible first workforce. Don't live near our office? No worries: you can work remotely from anywhere in the UK, the Netherlands or Ireland. The Ads organization powers Reddit's advertising platform, enabling advertisers to reach highly engaged communities while helping Reddit grow its business. The reliability of our Ads systems directly impacts advertiser success, revenue generation, and user experience. The Ads Reliability team partners closely with Ads Engineering to improve reliability, scalability, operational excellence, and developer productivity across Reddit's advertising ecosystem. We help build and operate highly available services that drive revenue and maintain advertiser trust. We're looking for a Senior Site Reliability Engineer to build, operate, and scale the critical systems behind Reddit Ads.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Reddit? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect