Senior Cloud Operations Engineer - AI Cloud Ops
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Support the operation and maintenance of cloud infrastructure across AWS environments
- Deploy, manage, and operate applications on Kubernetes clusters (EKS or similar)
- Perform Kubernetes troubleshooting including pod failures, scaling issues, networking, and cluster health
- Assist in managing production systems including monitoring, alerting, and incident response
- Troubleshoot and resolve infrastructure and application issues in a timely manner
- Work with engineering teams to support deployments and improve CI/CD workflows, including container-based releases
- Contribute to infrastructure automation using tools like Terraform and Ansible
- Help maintain system reliability, availability, and performance through day-to-day operations
- Participate in on-call rotation and incident response processes
- Assist with capacity planning and cost optimization efforts
- Partner with security teams to support compliance and security best practices
- Contribute to improving observability via logs, metrics, and dashboards
- AI-Driven Operations
- Actively leverage AI/ML tools and copilots to improve cloud operations efficiency and effectiveness
- Use AI-assisted solutions for incident triage, root cause analysis, alert noise reduction, and automation of repetitive tasks
- Continuously identify opportunities to apply AI to reduce operational toil and improve MTTR
- Collaborate with teams to integrate AI-driven insights into monitoring, logging, and operational workflows
- What You Bring to ServiceMax
- 3-5 years of experience in cloud operations, DevOps, or infrastructure engineering
- Ability to commute to the San Ramon office 2-3 times a week
- Basic to strong understanding of AWS cloud services
- Experience working in Linux/Unix environments
- Familiarity with container technologies (Docker and/or Kubernetes)
- Exposure to infrastructure-as-code tools (Terraform, Ansible, or similar)
- Understanding of CI/CD concepts and tools
- Familiarity with monitoring and logging tools (e.g., CloudWatch, ELK, Prometheus, Grafana)
- Strong problem-solving skills and willingness to learn
- Ability to work collaboratively in a team environment
- Good communication skills and attention to detail
- Required AI Skills
- Hands-on experience using AI-powered tools such as GitHub Copilot, AWS Kiro, Claude Code, Codex, etc
- Understanding of AI applications in cloud operations (anomaly detection, automation, incident analysis)
- Ability to use AI tools for troubleshooting, log analysis, and workflow improvement
- Demonstrated curiosity and willingness to adopt AI-driven approaches
- Nice to Have Qualifications
- AWS certifications (Associate level preferred)
- Exposure to scripting/programming (Python or Shell)
- Experience in SaaS or cloud-based production environments
- Exposure to AIOps or observability platforms with built-in AI capabilities
- What ServiceMax Offers You
- Competitive health and wellness benefits (Medical, Dental, Vision, Life Insurance)
- Flexible Spending Accounts
- Flexible Time Off
- 401(k) with employer match
- Commuter Benefits
- Opportunities for learning, mentorship, and career growth
- Compensation (CIP)
Benefits
Additional Information
Our world is transforming, and PTC is leading the way. Our software brings the physical and digital worlds together, enabling companies to improve operations, create better products, and empower people in all aspects of their business. Our people make all the difference in our success. Today, we are a global team of nearly 7,000 and our main objective is to create opportunities for our team members to explore, learn, and grow - all while seeing their ideas come to life and celebrating the differences that make us who we are and the work we do possible. Sr TechOps Engineer, AIOps (Cloud) Hybrid-San Ramon, CA What We Do ServiceMax is the global leader in Service Execution Management, delivering cloud-based software that helps companies maintain and service complex equipment at scale. Our platform powers mission-critical operations for customers around the world. We foster a collaborative, #wintogether and #customerobsessed culture, where engineers learn, grow, and contribute to building reliable, secure, and intelligent cloud systems.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at ptc? Share your experience