Lead Site Reliability Engineer - Managed Patching
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
We're the world's leading provider of secure financial messaging services, headquartered in Belgium. We are the way the world moves value - across borders, through cities and overseas. No other organisation can address the scale, precision, pace and trust that this demands, and we're proud to support the global economy. We're unique too. We were established to find a better way for the global financial community to move value - a reliable, safe and secure approach that the community can trust, completely. We're always striving to be better and are constantly evolving in an ever-changing landscape, without undermining that trust. Five decades on, our vibrant community reflects the complexity and diversity of the financial ecosystem. We innovate diligently, test exhaustively, then implement fast. In a connected and exciting era, our mission has never been more relevant. Swift now has a presence in 200+ countries and legal territories to serve a community of more than 12,000 banks and financial institutions. Experience and Qualifications 12+ years in SRE, DevOps, Infrastructure Automation, or Platform Engineering Proven experience building and operating automation at enterprise scale Strong track record of leading end-to-end technical initiatives Degree in Computer Science/Engineering or equivalent experience
Responsibilities
- Own the Managed Patching Service End-to-End
- Lead the design, build, and rollout of an automated patching service at enterprise scale
- Own service lifecycle: reliability, scalability, performance, and compliance
- Translate regulatory and enterprise requirements into engineering solutions with audit-ready outcomes
- Drive the evolution of the service towards a platform-based model over time
- Build Automation Architecture and Orchestrated Workflows
- Design and implement patching workflows using Ansible Automation Platform
- Integrate with CI/CD orchestration tools such as CloudBees or equivalent
- Define robust automation patterns: idempotency, versioning, rollback, safe concurrency, and failure isolation
- Extend automation using Python where needed for orchestration logic and integrations
- Implement Git-based workflows for version control, testing, and release governance
- Define Service Model and Platform Integration
- Design subscription and onboarding model via ServiceNow
- Define scheduling, maintenance windows, and deployment strategies
- Build integration patterns across ServiceNow, automation platforms, and inventory systems
- Contribute towards evolving the service into a self-service platform
- Observability, Reporting and Compliance
- Design telemetry for patch outcomes, compliance posture, and drift detection
- Build reporting capabilities for operational and regulatory visibility
- Ensure all activities are traceable, auditable, and evidence-ready
- Reliability Engineering and Continuous Improvement
- Define SLIs, SLOs, and operational standards for the service
- Lead incident response, root cause analysis, and corrective actions
- Drive continuous improvement to reduce toil and improve resiliency
- 6.Cross-Functional Technical Leadership
- Collaborate with Infrastructure, Security, Architecture, and Platform teams
- Align patching strategies with enterprise standards and dependencies
- Drive adoption of the service across teams
- Influence without authority across a distributed organization
- Required Skills
- Strong expertise in Ansible Automation Platform
- Experience with CI/CD orchestration tools (CloudBees or equivalent)
- Deep Linux/RHEL knowledge including patching and system internals
- Experience managing large-scale infrastructure environments
- Strong understanding of SRE principles and operational practices
- Experience integrating with ServiceNow or similar platforms
- Strong stakeholder communication and cross-team collaboration
- Preferred Skills
- Experience with platform engineering and self-service models
- Familiarity with infrastructure as code practices
- Observability and monitoring integration experience
- Experience with reporting tools such as Power BI
- Experience in regulated or financial environments
- What Success Looks Like (6-12 Months)
- Managed Patching Service is live and stable for initial scope
- Automated patching pipeline operational with minimal manual intervention
- Clear visibility of patch compliance and outcomes across environments
- Service scaled to broader adoption with standardized onboarding and execution
- Foundations established for evolving into a self-service, platform-driven model
Benefits
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at swift? Share your experience