Staff SRE SW Development Engineer

External

Dexcom · Bengaluru, India

Full-timeOn-site6d ago

BashCI/CDComplianceGCPIncident ResponseKubernetes

Cover Letter Connect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role

Benefits

Health insuranceVision insurance

Additional Information

The Company Dexcom Corporation (NASDAQ DXCM) is a pioneer and global leader in continuous glucose monitoring (CGM). Dexcom began as a small company with a big dream: To forever change how diabetes is managed. To unlock information and insights that drive better health outcomes. Here we are 25 years later, having pioneered an industry. And we're just getting started. We are broadening our vision beyond diabetes to empower people to take control of health. That means personalized, actionable insights aimed at solving important health challenges. To continue what we've started: Improving human health. We are driven by thousands of ambitious, passionate people worldwide who are willing to fight like warriors to earn the trust of our customers by listening, serving with integrity, thinking big, and being dependable. We've already changed millions of lives and we're ready to change millions more. Our future ambition is to become a leading consumer health technology company while continuing to develop solutions for serious health conditions. We'll get there by constantly reinventing unique biosensing-technology experiences. Though we've come a long way from our small company days, our dreams are bigger than ever. The opportunity to improve health on a global scale stands before us. Meet the Team Dexcom is seeking a motivated and experienced Senior Site Reliability Engineer to architect, build, and operate the resilient, scalable, and secure cloud infrastructure that powers our R&D Platform serving millions of Customers every day. This role is crucial in ensuring rapid, safe, and compliant delivery of life-changing medical technologies. As a senior technical leader within the SRE team, you will provide strategic guidance and technical oversight, partnering with engineering, platform, and architecture groups to drive organizational reliability maturity. You will be responsible for leading initiatives in automation, observability, and incident management while fostering a culture of operational excellence and continuous improvement. This position offers a unique opportunity to shape the future of Dexcom's evolving cloud reliability strategy in a fast-paced, collaborative environment. You will play a pivotal role in driving systematic root cause analysis and mentoring other engineers to adopt a reliability-first design mindset across our global cloud-native ecosystem. Where you come in: Architect and evolve Dexcom's observability ecosystem, defining standards for metrics, logging, tracing, and SLO/SLA-driven reliability. Design, build, and operate highly available cloud infrastructure on Google Cloud Platform (GCP), focusing on performance, scalability, and security. Lead Kubernetes platform operations, improving cluster reliability, multi-tenant architecture, and deployment patterns. Diagnose and resolve complex failures across cloud infrastructure, CI/CD pipelines, policy engines, and microservices. Set the direction for Infrastructure as Code (IaC), defining best practices with Terraform, Pulumi, or Crossplane for automated provisioning. Drive automation strategy to eliminate toil, build self-service capabilities, and operationalize guardrails for compliance and cost efficiency. Lead major incident response and conduct deep post-incident reviews to implement remediations that prevent recurring failure categories. Mentor engineers and influence cross-functional practices to help teams adopt operational discipline and cloud-native best practices. Partner with developer teams to optimize capacity strategies and ensure the seamless delivery of high-quality solutions. What makes you successful: Problem-Solver & Innovator: Proven ability to solve complex failures across distributed systems, navigating technical debt to drive long-term systematic fixes. Technical Maestro: Expert-level knowledge of GCP and deep Kubernetes operational mastery, setting the stage for resilient cloud-native architectures. Visionary Leader: Your portfolio highlights a series of well-orchestrated reliability initiatives, reflecting your flair for turning technical strategy into operational success. Observability Strategist: Extensive experience designing metrics pipelines and SLO frameworks that provide actionable insights and reduce MTTR. Automation Advocate: Advanced proficiency in Python, Go, or Bash, with a track record of building maintainable tooling that eliminates manual toil. Great Communicator: Exceptional skills in articulating complex technical concepts to both engineering peers and stakeholders, ensuring alignment on reliability goals. Analytical Architect: Apply a software engineering mindset to build, test, and maintain infrastructure code that ensures scalable and repeatable platform provisioning. Hands-on experience with modern declarative ecosystems like Pulumi, Crossplane, or similar tools is highly preferred. Collaborative Mentor: Strong ability to influence architectural decisions across teams while motivating and growi

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at dexcom? Share your experience

Interested in this role?

Apply on the company's website.

Cover Letter Connect