Skip to main content
Back to jobs

Site Reliability Engineer IV

External
premera logoPremera · Mountlake Terrace, WA
Full-timeHybrid2w ago
CI/CDClassificationComplianceDockerIncident ResponseJava
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • Build, run, and optimize critical services across cloud, on-premise, and hybrid environments, including managed services, custom applications, and third-party integrations
  • Develop automation and AI-powered tooling to reduce manual intervention, including anomaly detection, predictive alerting, and LLM-assisted diagnostics that surface actionable insights
  • Design and implement end-to-end observability, telemetry, and self-healing capabilities across platforms
  • Lead cross-team efforts to drive root cause analysis, post-incident reviews, and long-term reliability improvements
  • Define and drive reliability strategy, standards, and best practices across engineering teams
  • Standardize workflows for change management, deployment, and incident response, replacing manual processes with tooling-driven solutions
  • Partner with engineering and security teams to ensure deployment pipelines and automation practices meet reliability, safety, and compliance standards
  • Influence adoption of modern DevOps practices including CI/CD, infrastructure-as-code, and test-driven development
  • Stay current on emerging technologies in AI/ML, DevOps, and platform engineering, and apply them to improve operational efficiency
  • Participate in the on-call rotation and support production systems as needed
  • This role does not involve day-to-day coding, it requires strong technical depth to guide teams, conduct rapid proofs of concept, and provide guidance on performance, reliability, cost and operational excellence.

Requirements

  • Bachelor's degree in Computer Science, Information Systems, or related field - or equivalent experience
  • 7+ years of experience in Site Reliability Engineering, DevOps, or IT Operations within complex environments
  • Demonstrated experience leveraging AI platforms and tooling to design and build automation solutions.
  • Hands on experience applying AI/ML to operational workflows, including anomaly detection, predictive alerting, or intelligent automation at scale
  • Advanced experience with Kubernetes, Docker, and container-based platforms
  • Deep expertise with event streaming platforms
  • Experience working across cloud, on-premise, and hybrid environments
  • Experience working in large-scale, regulated enterprise environments.
  • Knowledge, Skills, and Abilities
  • Advanced troubleshooting across distributed systems and applications
  • Proficiency in one or more programming languages such as Python, Java, C#, JavaScript, or PowerShell
  • Familiarity with AI/ML concepts and integrating intelligent automation into operational workflows
  • Ability to debug complex systems and guide teams through technical problem-solving
  • Strong collaboration and communication skills across engineering teams
  • Premera total rewards
  • Medical, vision, and dental coverage with low employee premiums.
  • Voluntary benefit offerings, including pet insurance for paw parents.
  • Life and disability insurance.
  • Retirement programs, including a 401K employer match and, believe it or not, a pension plan that is veste

Benefits

Health insuranceDental insuranceVision insurance401(k)

Additional Information

Workforce Classification: Hybrid Join Our Team: Do Meaningful Work and Improve People's Lives Our purpose, to improve customers' lives by making healthcare work better, is far from ordinary. And so are our employees. Working at Premera means you have the opportunity to drive real change by transforming healthcare. Premera is committed to being a workplace where people feel empowered to grow, innovate, and lead with purpose. By investing in our employees and fostering a culture of collaboration and continuous development, we're able to better serve our customers. It's this commitment that has earned us recognition as one of the best companies to work for. Learn more about our recent awards and recognitions as a greatest workplace. Learn how Premera supports our members, customers and the communities that we serve through our Healthsource blog: https://healthsource.premera.com/ . Site Reliability Engineer IV Job Description Summary As a Site Reliability Engineer IV, you will drive reliability and operational excellence across cloud, on-premise, and hybrid platforms. You will build scalable automation and AI-powered tooling to improve system health, reduce manual effort, and accelerate incident response. Partnering with software and platform engineering teams, you will standardize CI/CD, observability, and incident management practices, enabling resilient, self-healing systems. This role is critical to scaling engineering reliability and advancing intelligent automation across enterprise platforms. This is a hybrid role, located on our campus in Mountlake Terrace, Washington


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at premera? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect