Senior Site Reliability Engineer (Fully Remote)
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
At Partnerize, we're on a mission to transform the way businesses grow. We've built the leading partnership automation platform that empowers brands to discover, engage, and convert their audiences at scale. From affiliate marketing to influencer collaborations, we help our clients build and manage profitable partnerships that drive real results. We're a team of passionate problem-solvers who are dedicated to helping our clients win in the ever-evolving world of digital marketing. Why Join Us We're looking for passionate, talented people who want to be part of a winning team. At Partnerize, you'll find a culture of collaboration, innovation, and respect. We're guided by our core values, and we're committed to creating an environment where everyone can do their best work. We also offer a competitive salary, generous benefits, and a flexible work environment that allows you to thrive both personally and professionally. If you're ready to grow your career and make a difference, we'd love to hear from you. Senior Site Reliability Engineer Job Summary The Senior Site Reliability Engineer is a key technical leadership role at the intersection of infrastructure, platform engineering, operational resilience, and developer enablement. At Partnerize, we operate a high-scale platform processing over a billion daily events across a hybrid estate spanning on-prem datacentres and AWS cloud infrastructure. You will play a critical role in ensuring our systems remain reliable, scalable, secure, and operationally efficient whilst helping evolve our engineering culture towards a modern "you build it, you own it" model. This is not a purely operational support role. We are looking for an experienced technical leader and subject matter expert who can balance hands-on engineering with strategic thinking - someone capable of modernising legacy systems, improving observability and automation, and guiding engineers through increasingly complex distributed environments. The Things You Care About At the heart of our platform, we process and distribute performance marketing data at enormous scale, generating over a billion events across our infrastructure daily. Our systems power real-time decision making, partner attribution, analytics, and customer-facing products that require high availability and operational excellence. As we continue to scale globally, we are evolving our infrastructure, modernising legacy systems, and investing heavily in automation, containerisation, observability, and platform reliability. We deploy to production multiple times a day and operate a hybrid environment spanning physical datacentres and AWS cloud services. Our infrastructure and engineering ecosystem includes technologies such as Linux, Kubernetes, Docker, Kafka, MySQL, PostgreSQL, Redis, Python, Terraform, Ansible, Elasticsearch, Druid, and modern CI/CD tooling. As a Senior Site Reliability Engineer at Partnerize, You Will: Lead Reliability & Operations Take ownership of platform reliability, availability, and operational performance across a complex hybrid estate supporting Partnerize, BrandVerity, Ascend, and Konnecto. Drive Platform Modernisation Help lead the transition from legacy operational models towards modern containerised infrastructure and self-service engineering platforms. Champion DevOps Enablement Empower development teams through automation, tooling, and platform improvements that support a "you build it, you own it" engineering culture. Act as a Technical Authority Provide technical leadership and guidance across infrastructure, distributed systems, security, observability, and operational best practices. Improve Security & Resilience Partner closely with security and compliance teams to strengthen platform security posture, conduct threat modelling, vulnerability assessments, and support secure-by-design engineering practices. Mentor & Develop Engineers Act as a player-coach within the SRE organisation, mentoring engineers, supporting technical growth, and helping shape future technical leaders. Lead Incident Management Take ownership during major incidents, driving clear communication, structured troubleshooting, and pragmatic resolution strategies across the wider business. Drive Automation & Continuous Improvement Continuously improve infrastructure, deployment pipelines, monitoring, and operational tooling to improve reliability, scalability, and delivery velocity. You are an experienced Site Reliability Engineer with: Deep Technical Expertise Strong experience operating and supporting distributed systems at scale across Linux, cloud, and hybrid infrastructure environments. Platform Engineering Experience Hands-on expertise with Kubernetes, Docker, Terraform, Ansible, CI/CD pipelines, and infrastructure automation. Strong Systems Thinking The ability to troubleshoot complex systems methodically, balancing operational stability with long-term architectural improvements. Securit
Benefits
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at partnerize? Share your experience