Skip to main content
Back to jobs

Production Engineer/Site Reliability Engineer (Shift Basis)

External
Rubrik logoRubrik · Bangalore, India
Full-timeOn-site2w ago
AuditingCloudFormationIncident ResponseKubernetesMachine LearningMySQL
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

Production Engineer The Production Engineer at Rubrik plays a critical role in operational excellence, managing alerts, responding to outages, and leading incident resolution as an Incident Manager. This role requires hands-on experience in maintaining highly available critical services across multi-cloud environments while driving continuous improvements through automation and intelligent monitoring.

Responsibilities

  • Join a 24/7 Production Operations team responsible for managing and supporting critical infrastructure and services in multi-cloud environments.
  • Oversee staging and production environments to ensure maximum uptime and reliability.
  • Implement and maintain comprehensive observability solutions for real-time monitoring, alerting, and metrics collection.
  • Lead incident management efforts by swiftly responding to alerts and outages, coordinating teams to drive timely resolution.
  • Analyze recurring incidents to identify root causes, reduce toil, and improve system resilience.
  • Design and develop automation tools to proactively detect, triage, and remediate production issues.
  • Maintain and update runbooks to support incident response and recurring issues.
  • Demonstrate strong decision-making skills under pressure, effectively managing critical situations with urgency and composure.
  • Experience you'll need:
  • Solid understanding of distributed system concepts.
  • Practical experience working with production systems and environments, preferably within public cloud infrastructures.
  • Familiarity with container orchestration platforms, especially Kubernetes.
  • Hands-on experience with infrastructure management tools like CloudFormation and Terraform.
  • Strong analytical and problem-solving skills for diagnosing and resolving system and application issues.
  • Proficient in data structures and algorithms, UNIX, networking, operating systems, and database systems such as MySQL.
  • Proficient with Python programming skills.
  • Excellent verbal and written communication skills.
  • Location: Bangalore, India
  • Work Shift: Rotation (24/7 coverage expected)
  • ABOUT RUBRIK
  • Join Us in Securing the World's Data
  • Join Us in Securing and Accelerating the World's AI Transformation
  • Linkedin | X (formerly Twitter) | Instagram | Rubrik.com
  • Inclusion @ Rubrik
  • At Rubrik, we are dedicated to fostering a culture where people from all backgrounds are valued, feel they belong, and believe they can succeed. Our commitment to inclusion is at the heart of our mission to secure the world's data.
  • Our inclusion strategy focuses on three core areas of our business and culture:
  • Our Company: We are committed to building a merit-based organization that offers equal access to growth and success for all employees globally. Your potential is limitless here.
  • Our Culture: We strive to create an inclusive atmosphere where individuals from all backgrounds feel a strong sense of belonging, can thrive, and do their best work. Your contributions help us innovate and break bou

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Rubrik? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect