AI Operations Engineering Technical Leader
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Cisco is looking for a highly experienced and innovative ML Operations Engineer to join
- Design, build, and manage robust ML pipelines for training, validation, and deployment
- Build and maintain scalable infrastructure using Kubeflow for ML experiments and inference in multiple public cloud s
- Implement CI/CD in GitHub for ML systems ensuring reproducibility and traceability
- Experience driving the implementation LLM evaluation and observability solutions
- Advocate automation in every layer of the infrastructure stack using Infrastructure as Code ( IaC ) principles and tools such as Terraform, Helm, and GitOps frameworks
- Monitor models in production for performance degradation, drift, and fairness
- Participate in on-call rotation for ML Operations
- Work closely with data scientists, engineers, and product managers to understand requirements and integrate models into applications
Requirements
- Bachelor's degree in Comp Science, Engineering (or related field /industry) + 8 years of DevOps experience, Masters + 6 years of related experience, or PhD + 3 years of related experience .
- Understanding of CI/CD pipelines and automation tools .
- Knowledge of cloud platforms , minimally AWS with Azure and GCP as a bonus
- Proficiency in Python and familiarity with ML libraries (e.g., Scikit-learn, PyTorch , TensorFlow , etc. )
- Strong understanding of ML lifecycle management and model versioning
- Experience deploying large language models (LLMs) or generative AI systems
- Familiarity with feature stores, vector databases, or data observability platforms
- Excellent communication, collaboration, and mentoring skills.
- Deep expertise in CI/CD tooling and practices, including hands-on experience with systems like Jenkins, GitLab, ArgoCD , or similar.
- Strong proficiency in Kubernetes, Docker, and cloud-native patterns in AWS, Azure, or GCP.
- Why Cisco?
- We are Cisco, and our power starts with you.
- Message to applicants applying to work in the U.S. and/or Canada:
- The starting salary range posted for this position is $212,300.00 to $275,800.00 and reflects the projected salary range for new hires in this position in U.S. and/or Canada locations, not including incentive compensation*, equity, or benefits.
Benefits
Additional Information
The application window is expected to close on: 06/29/2026 Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received . Meet the Team The Cisco AI Software & Platform Group incubates and delivers Generative AI based solutions to reinvent Cisco's existing Products and how customers interact with them. Our Group is also introducing new offerings that help customers roll out Generative AI at scale while doing so responsibly. Ultimately, we are doing so through internal platforms that unlock the benefits of this technology for Cisco teams and partners across our Security, Enterprise Networking, Collaboration, and Splunk portfolios.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Cisco? Share your experience