Principal, Cloud Engineer - Observability
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
Do you want to work on cutting-edge cloud technologies that power the next generation of enterprise-scale platforms? We are seeking a highly motivated Principal Cloud Engineer to join our Observability Platform team within Fidelity Architecture and Engineering. In this role, you will help design, build, and operate scalable, cloud-native observability solutions that support our most critical digital services. You will work in a collaborative, transparent, and innovation-driven environment where engineering excellence, continuous learning, and open-source contribution are core to how we operate. This is a high-impact role where your expertise will influence platform architecture, engineering practices, and the developer experience across the organization.
Responsibilities
- Lead the design and implementation of cloud-native observability platforms across AWS and Azure environments
- Develop and operate highly scalable systems on AWS (EKS, core services) with strong focus on reliability, performance, and automation
- Own the end-to-end lifecycle of observability tooling, including hosting, maintenance, scaling, and optimization
- Drive adoption of OpenTelemetry (OTel) standards for metrics, traces, logs, and profiling
- Build and enhance platform capabilities using Python and/or Go
- Architect and optimize CI/CD pipelines enabling rapid, secure, and reliable deployments
- Collaborate with cross-functional teams to improve system visibility, debugging capabilities, and performance insights
- Define and promote best practices for cloud engineering, observability, and platform reliability
- Mentor engineers and provide technical leadership across squads
- The Experience We're Looking For
- 10+ years of software engineering or cloud engineering experience
- Deep expertise in AWS cloud stack, especially: EKS (Kubernetes on AWS)
- Core services (IAM, EC2, networking, storage, etc.)
- Strong experience working with Kubernetes and cloud-native ecosystems
- Hands-on experience with observability tools/platforms (e.g., Prometheus, Grafana, Datadog, OpenTelemetry, etc.), including hosting and operational ownership
- Proficiency in Python and/or Go for platform and tooling development
- Strong understanding of CI/CD practices and tools
- Experience working with Infrastructure as Code (Terraform, CloudFormation, etc.)
- Familiarity with the Azure cloud stack and hybrid/multi-cloud environments
- Working knowledge of OpenTelemetry (OTel) concepts and implementation
- Bonus Skills
- Experience building or operating large-scale internal platforms
- Exposure to eBPF-based observability or advanced profiling solutions
- Experience integrating observability across multi-region / multi-cloud environments
- Experience or interest in applying AI/ML techniques to observability (e.g., anomaly detection, predictive insights, intelligent alerting, or AIOps)
- Active participation in or contributions to open-source projects
- Strong background in performance optimization and distributed systems
- Why Join Us
- Work on mission-critical, high-scale platforms
- Be part of a forward-looking, cloud-first organization
- Contribute to open technologies and modern observability practices
- Thrive in a culture that values learning, ownership, and engineering excellence
- Certifications:
- Category:
Benefits
Additional Information
Job Description: Note: Fidelity will not provide immigration sponsorship for this position.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Fidelity? Share your experience