Skip to main content
Back to jobs

Senior Manager, Site Reliability Engineering

External
Clover Health logoClover Health · Remote
Full-timeRemoteToday
ArgoCDCI/CDGCPGitHubGitHub ActionsGrafana
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Benefits

Health insuranceDental insuranceVision insurance401(k)Remote work optionsFlexible scheduleEquity / stock optionsPerformance bonusParental leave

Additional Information

At Counterpart Health, we are transforming healthcare and improving patient care with our innovative primary care tool, Counterpart Assistant. By supporting Primary Care Physicians (PCPs), we deliver improved outcomes at lower cost through early diagnosis and longitudinal care management of chronic conditions. We're looking for a Senior Manager of Site Reliability Engineering to join our team. You'll lead a team of ~10 SREs across North America, UK, HK, and New Zealand - owning both the day-to-day operations and the long-term technical direction of the SRE organization. This role sits at the intersection of people leadership, technical depth, and strategic partnership: you're here to make Counterpart's infrastructure reliable, scalable, and cost-efficient - and to transform the SRE team's engagement model from reactive support to proactive collaboration with our product engineering pillars. As a Senior Manager, Site Reliability Engineering, you will: Lead and grow our SRE team of ~10 engineers, including hiring, retention, career development, and performance management across multiple time zones (US, HK, NZ). Build strategic partnerships with product engineering pillars - shifting SRE from reactive, ticket-based support to proactive co-ownership of reliability outcomes. Scale our multi-tenant infrastructure to support new customer onboarding and growing patient populations. Own cloud cost management and FinOps practices, building frameworks that balance cost control with reliability and performance. Champion developer self-service and platform engineering. Build self-service capabilities so product teams can manage routine operations without filing SRE tickets. Establish SLOs/SLIs for critical services and improve alert quality so every page is meaningful. Ensure the SRE team is fully leveraging AI tooling in their workflows - using tools like Claude Code for IaC generation, log analysis, root cause investigation, and automating repetitive work - at the same level as the rest of engineering. You should get in touch if: You have 6+ years managing an SRE team and 10+ years of hands-on SRE or infrastructure engineering experience. You're deeply comfortable with our core stack: Kubernetes, GCP (GKE, Cloud SQL, Pub/Sub, GCS), Terraform, Helm, ArgoCD, PostgreSQL, and Prometheus/Grafana. You have strong programming skills in Python and/or Go, and you're comfortable writing and reviewing infrastructure tooling code - including using AI coding tools to do so. You have experience with CI/CD pipelines (GitHub Actions) and a track record of building or improving developer tooling and automation. You have sound build vs. buy judgment - you default to the right answer, not the easiest one, and you're comfortable building internal tooling when existing solutions don't fit. You have experience leading teams across multiple time zones and a track record of developing engineers into strong technical contributors. Benefits Overview : Financial Well-Being : Our commitment to attracting and retaining top talent begins with a competitive base salary and equity opportunities. Additionally, we offer a performance-based bonus program, 401k matching, and regular compensation reviews to recognize and reward exceptional contributions. Physical Well-Being : We prioritize the health and well-being of our employees and their families by providing comprehensive medical, dental, and vision coverage. Your health matters to us, and we invest in ensuring you have access to quality healthcare. Mental Well-Being : We understand the importance of mental health in fostering productivity and maintaining work-life balance. To support this, we offer initiatives such as No-Meeting Fridays, monthly company holidays, access to mental health resources, and a generous flexible time-off policy. Additionally, we embrace a remote-first culture that supports collaboration and flexibility, allowing our team members to thrive from any location. Professional Development : Developing internal talent is a priority for Clover. We offer learning programs, mentorship, professional development funding, and regular performance feedback and reviews. Additional Perks: Employee Stock Purchase Plan (ESPP) offering discounted equity opportunities Reimbursement for office setup expenses Monthly cell phone & internet stipend Remote-first culture, enabling collaboration with global teams Paid parental leave for all new parents And much more! About Counterpart Health: In 2018, Clover Health set out to do something unprecedented: build a clinically intuitive, AI-enabled solution that fits within physicians' workflows to help support the earlier diagnosis and management of chronic conditions. Years later, that vision is a reality, with thousands of practitioners using Counterpart Assistant during patient visits to improve disease management, reduce medical expenses, and drive success in value-based care. With an exceptional team of value-based care


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Clover Health? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect