Staff DevOps Engineer
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
This team manages essential meeting service operations at Zoom. They handle global, large-scale distributed systems and advance communication technology to connect individuals across physical distances.
Requirements
- 10+ years in DevOps, SRE, or infrastructure engineering roles, with at least 3 years at a staff or principal level scope.
- Have a proven track record owning reliability for large-scale, distributed, latency-sensitive systems in production
- Have experience in supporting real-time or media-heavy platforms (video conferencing, live streaming, gaming, trading systems, or similar).
- Demonstrate ability to lead cross-functional technical initiatives without direct authority, driving alignment across engineering, product, and operations.
- Have conceptual and architectural understanding of real-time communication protocols: WebRTC, RTP/RTCP, TURN/STUN, SDP, and SFU/MCU topologies.
- Have solid expertise in cloud infrastructure (AWS, GCP, or Azure) and container orchestration (Kubernetes, Helm, ArgoCD).
- Demonstrate proficiency with infrastructure-as-code tooling: Terraform, Pulumi, or equivalent.
- Have experience with observability stacks: Prometheus, Grafana, Datadog, Jaeger, OpenTelemetry, or equivalent.
- Have an understanding of networking fundamentals: BGP, anycast routing, DNS, load balancing, and CDN architecture.
- Utilize CI/CD tools such as GitHub Actions, Jenkins, and Spinnaker to streamline workflows and improve deployment processes.
- Implement deployment safety practices like canary releases, feature flags, and blue/green strategies to ensure reliable software delivery.
- Demonstrate proficiency in Python, Bash, or Go for automation, tooling, and incident response without requiring advanced software developm
Benefits
Additional Information
Immigration sponsorship is not available for this position What you can expect We are hiring a Staff DevOps/Site Reliability Engineer to ensure reliability, scalability, and operational excellence for our real-time communications platform. This platform supports audio/video conferencing, recording, and live-streaming functionalities. The position requires expertise in infrastructure engineering, global team collaboration, and cross-functional partnerships.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Zoom? Share your experience