Senior Site Reliability Engineer - Observability
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
We are looking for a Senior SRE to join our Platform Engineering team as the operations owner of our observability platforms. You'll be responsible for the reliability, scalability, and continued evolution of the tools that give our engineering organization visibility into everything they build and run. The current observability platform is primarily comprised of on-premises ELK (Elasticsearch, Logstash, Kibana) Stack and Grafana, with some exposure to New Relic and SolarWinds. This is a hybrid role: roughly half your time will be spent on steady-state operations and platform support, and the other half on engineering projects that meaningfully advance the platforms you support. It's a great fit for someone who is genuinely motivated by the pursuit of excellence - not just sustaining what works but relentlessly refining it. You take pride in the platforms you own, and that pride drives you to keep improving them, whether that means tightening an SLO, eliminating a source of toil, or building something that gives teams faster insight into their systems.