Lead Site Reliability Engineer - Managed Patching

External

Swift · Manassas

Full-timeOn-siteToday

AnsibleCI/CDComplianceGitIncident ResponseLeadership

Cover Letter Connect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role

About the role

We're the world's leading provider of secure financial messaging services, headquartered in Belgium. We are the way the world moves value - across borders, through cities and overseas. No other organisation can address the scale, precision, pace and trust that this demands, and we're proud to support the global economy. We're unique too. We were established to find a better way for the global financial community to move value - a reliable, safe and secure approach that the community can trust, completely. We're always striving to be better and are constantly evolving in an ever-changing landscape, without undermining that trust. Five decades on, our vibrant community reflects the complexity and diversity of the financial ecosystem. We innovate diligently, test exhaustively, then implement fast. In a connected and exciting era, our mission has never been more relevant. Swift now has a presence in 200+ countries and legal territories to serve a community of more than 12,000 banks and financial institutions. Experience and Qualifications 12+ years in SRE, DevOps, Infrastructure Automation, or Platform Engineering Proven experience building and operating automation at enterprise scale Strong track record of leading end-to-end technical initiatives Degree in Computer Science/Engineering or equivalent experience

Responsibilities

Own the Managed Patching Service End-to-End
Lead the design, build, and rollout of an automated patching service at enterprise scale
Own service lifecycle: reliability, scalability, performance, and compliance
Translate regulatory and enterprise requirements into engineering solutions with audit-ready outcomes
Drive the evolution of the service towards a platform-based model over time
Build Automation Architecture and Orchestrated Workflows
Design and implement patching workflows using Ansible Automation Platform
Integrate with CI/CD orchestration tools such as CloudBees or equivalent
Define robust automation patterns: idempotency, versioning, rollback, safe concurrency, and failure isolation
Extend automation using Python where needed for orchestration logic and integrations
Implement Git-based workflows for version control, testing, and release governance
Define Service Model and Platform Integration
Design subscription and onboarding model via ServiceNow
Define scheduling, maintenance windows, and deployment strategies
Build integration patterns across ServiceNow, automation platforms, and inventory systems
Contribute towards evolving the service into a self-service platform
Observability, Reporting and Compliance
Design telemetry for patch outcomes, compliance posture, and drift detection
Build reporting capabilities for operational and regulatory visibility
Ensure all activities are traceable, auditable, and evidence-ready
Reliability Engineering and Continuous Improvement
Define SLIs, SLOs, and operational standards for the service
Lead incident response, root cause analysis, and corrective actions
Drive continuous improvement to reduce toil and improve resiliency
6.Cross-Functional Technical Leadership
Collaborate with Infrastructure, Security, Architecture, and Platform teams
Align patching strategies with enterprise standards and dependencies
Drive adoption of the service across teams
Influence without authority across a distributed organization
Required Skills
Strong expertise in Ansible Automation Platform
Experience with CI/CD orchestration tools (CloudBees or equivalent)
Deep Linux/RHEL knowledge including patching and system internals
Experience managing large-scale infrastructure environments
Strong understanding of SRE principles and operational practices
Experience integrating with ServiceNow or similar platforms
Strong stakeholder communication and cross-team collaboration
Preferred Skills
Experience with platform engineering and self-service models
Familiarity with infrastructure as code practices
Observability and monitoring integration experience
Experience with reporting tools such as Power BI
Experience in regulated or financial environments
What Success Looks Like (6-12 Months)
Managed Patching Service is live and stable for initial scope
Automated patching pipeline operational with minimal manual intervention
Clear visibility of patch compliance and outcomes across environments
Service scaled to broader adoption with standardized onboarding and execution
Foundations established for evolving into a self-service, platform-driven model

Benefits

Dental insuranceVision insurancePerformance bonus

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at swift? Share your experience

Interested in this role?

Apply on the company's website.

Cover Letter Connect