Manager, SRE
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Lead and Mentor: Manage, mentor, and grow a high-performing SRE team, fostering a culture of ownership, learning, and collaboration.
- System Reliability: Own the availability, scalability, and performance of business-critical systems across production environments.
- Incident Management: Define and improve incident response processes, ensuring quick mitigation and clear postmortems to drive learning and prevention.
- Operational Excellence: Develop and track SLOs, SLIs, and SLAs, ensuring engineering teams meet reliability targets.
- Automation & Tooling: Drive initiatives to automate manual operations, improve observability, and reduce toil.
- Collaboration: Work closely with software engineering, platform, and security teams to build reliable, secure, and scalable systems.
- Capacity & Cost Planning: Anticipate growth and ensure systems scale efficiently while keeping infrastructure costs under control.
- Continuous Improvement: Champion reliability best practices, chaos testing, and capacity planning to evolve systems proactively.
- On-Call Health: Maintain a healthy on-call rotation that balances coverage with team well-being.
Requirements
- Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
- 8+ years of experience in Site Reliability Engineering, DevOps, or Production Engineering, with at least 2+ years in a people management or tech lead role.
- Strong understanding of distributed systems, cloud infrastructure (AWS preferred), container orchestration (Kubernetes), and networking fundamentals.
- Expertise in observability platforms (Datadog, Prometheus, OpenTelemetry) and incident management best practices.
- Proficiency with automation/scripting (Python, Go, or similar) and infrastructure-as-code (Terraform, Pulumi).
- Experience defining and driving SLO/SLI-based reliability strategies.
- Excellent problem-solving skills and ability to lead teams through complex production incidents.
- Exceptional communication skills, with the ability to collaborate with engineering and business stakeholders.
- Passion for building resilient systems and enabling teams to deliver software quickly and safely.
Benefits
Additional Information
Our Mission: 6sense's mission is to multiply what matters: growth, retention, and efficiency. We envision a future where companies, teams and people reach their full potential. Our People: People are the heart and soul of 6sense. We serve with passion and purpose. We live by our Being 6sense values of Win as One Team, Stay Curious, Do The Right Thing, Own the Outcome, and Create Belonging. Every 6sensor plays a part in defining the future of our industry-leading technology. 6sense is a place where difference-makers roll up their sleeves, take risks, act with integrity, and measure success by the value we create for our customers. We want 6sense to be the best chapter of your career. As an SRE Manager , you will be responsible for leading a team that ensures the scalability, reliability, and performance of 6sense's core infrastructure and customer-facing services. This role combines people leadership, technical depth, and operational excellence. You will set the vision for reliability engineering, own the availability of critical systems, and foster a culture of proactive problem-solving and continuous improvement. You will partner with engineering, product, and security teams to design systems that are resilient by default, minimize downtime, and scale to meet our rapidly growing customer base. This is a high-impact leadership role with visibility across the organization.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at 6sense? Share your experience