Skip to main content
Back to jobs

Site Reliability Engineering Lead, Specialist

External
Vanguard logoVanguard · Malvern, PA
Full-timeHybrid2w ago
AWSIncident ResponseJavaJavaScriptMicroservicesObservability
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Requirements

  • Expertise in JavaScript (server-side and client-side execution environments) or Java.
  • Working knowledge of Python (or similar scripting language)
  • Strong knowledge of resiliency engineering techniques for both platforms and applications.
  • Experience troubleshooting complex production issues and implementing effective mitigations.
  • Hands-on experience with AWS services and cloud infrastructure.
  • Familiarity with OpenTelemetry specification and core APIs.
  • Practical experience developing and operating software in distributed systems environments.
  • Special Factors
  • Sponsorship
  • Vanguard is not offering visa sponsorship for this position.
  • About Vanguard
  • At Vanguard, we don't just have a mission-we're on a mission.
  • To work for the long-term financial wellbeing of our clients. To lead through product and services that transform our clients' lives. To learn and develop our skills as individuals and as a team. From Malvern to Melbourne, our mission drives us forward and inspires us to be our best.
  • How We Work

Additional Information

At Vanguard, we pride ourselves on delivering an exceptional client experience to all investors; at the core of this experience are systems that reside in a technically complex and constantly evolving resiliency landscape. Passionate, technically skilled engineers are at the center of our resiliency operations, and we are looking to grow our team. We are seeking an experienced engineer with broad, end-to-end software development experience, including operating applications in a microservices environment in production at scale. This role goes beyond feature implementation - it requires someone who can design, build, and support resilient systems from the ground up. As a Senior Reliability Engineer at Vanguard, you will play a critical role in solving impactful operational problems. You are curious and take a proactive approach to identifying problems and making improvements. You balance innovative thinking with pragmatism and understand the long-term impacts of technical decisions. You communicate complex ideas clearly and collaborate effectively to deliver scalable solutions. Core Responsibilities Improve resiliency engineering practices across platforms and applications, including r esilient application design patterns, s ystem observability and d eployment strategies Incident detection, troubleshooting, and resolution. Develop automation for incident response and infrastructure management Develop and support OpenTelemetry integrations for multiple application platforms (browser, ECS, lambda, etc) and languages (JavaScript, Java) Contribute to architectural decisions and support implementation of solutions.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Vanguard? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect