Skip to main content
Back to jobs

Sr. Software Development Engineer - Orchestration Platform, Temporal, Fleet Management (Flexibility on level)

External
Zscaler logoZscaler · San Jose, CA
Full-timeOn-site1w ago
AgileAnsibleAWSCI/CDData ModelingDocker
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Benefits

Health insurance

Additional Information

About Zscaler Zscaler accelerates digital transformation to ensure our customers can be more agile, efficient, resilient, and secure. As an AI-forward enterprise , we are constantly pushing the envelope, leveraging the world's largest security data lake to power our cloud-native Zero Trust Exchange platform. This innovation protects our customers from cyberattacks and data loss by securely connecting users, devices, and applications in any location. Here, impact in your role matters more than title and trust is built on results. We say, impact over activity. We seek innovators who actively use AI to amplify their impact and who thrive in an environment where we leverage intelligent systems to stay ahead of evolving threats. We believe in transparency and value constructive, honest debate -we're focused on getting to the best ideas, faster. We build high-performing teams that can make an impact quickly and with high quality. To do this, we are building a culture of execution centered on customer obsession , collaboration, ownership, and accountability. We value high-impact, high-accountability with a sense of urgency where you're enabled to do your best work and embrace your potential. If you're driven by purpose, thrive on solving complex challenges, and want to be part of the team that's helping to secure the AI age, we invite you to bring your talents to Zscaler and help shape the future of cybersecurity. Role We are looking for a Software Engineer (Reliability) to join our team in San Jose, CA, reporting to the Vice President of Engineering. This is a hybrid role three days a week onsite within the Service Platform Automation department. You will build and operate the orchestration and reliability automation that manages ZIA's fleet lifecycle at massive scale. This is a high-ownership role: you will design and implement orchestration workflows and the supporting services needed for safe, deterministic, idempotent fleet operations-while helping the team evolve toward AI-first execution and operations. What you'll do (Role Expectations) Replace legacy Python/Ansible with a centralized, deterministic orchestration platform, refactoring automation into modular, well-defined workflows while retiring external dependencies and nested logic Engineer execution patterns with retries, idempotency, rate limits/backpressure, and safe rollbacks/compensation designs aligned to global fleet capacity Implement safe rollouts using segmentation, canaries, and automated health checks to limit blast radius during fleet-wide upgrades and remediation Add strong observability and auditability (metrics, traces, replayable histories), participate in on-call rotation, and drive software based fixes to reduce toil following post-incident reviews Integrate AI/LLM capabilities to accelerate legacy code migration and enhance safe operational outcomes through intelligent triage, correlation, and automated runbook generation Who You Are (Success Profile) You thrive in ambiguity. You're comfortable building the path as you walk it. You thrive in a dynamic environment, seeing ambiguity not as a hindrance, but as the raw material to build something meaningful. You act like an owner. Your passion for the mission fuels your bias for action. You operate with integrity because you genuinely care about the outcome. True ownership involves leveraging dynamic range: the ability to navigate seamlessly between high-level strategy and hands-on execution. You are a problem-solver. You love running towards the challenges because you are laser-focused on finding the solution, knowing that solving the hard problems delivers the biggest impact. You are a high-trust collaborator. You are ambitious for the team, not just yourself. You embrace our challenge culture by giving and receiving ongoing feedback-knowing that candor delivered with clarity and respect is the truest form of teamwork and the fastest way to earn trust. You are a learner. You have a true growth mindset and are obsessed with your own development, actively seeking feedback to become a better partner and a stronger teammate. You love what you do and you do it with purpose. What We're Looking for (Minimum Qualifications) BS/MS in Computer Science or a related technical field with 5+ years of experience building and operating production-grade software systems Strong proficiency in backend/systems languages (Go, Java, C++, or Rust) with the ability to write high-quality, maintainable code Deep experience designing and operating distributed systems, including concurrency, failure handling, performance optimization, and data modeling Proven track record of building automation using REST APIs and Swagger with strong guarantees for idempotency, verification, and safe rollout patterns Hands-on experience with cloud platforms (AWS/GCP, GKE, Cloud SQL etc.) and proficiency in containerization and CI/CD workflows using Docker and GitLab What Will Make You Stand Out (Preferred


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Zscaler? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect