Site Reliability Engineering Lead
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
As the Site Reliability Engineering (SRE) Lead, your mission is to ensure the reliability, security, scalability, and performance of Intigriti's cloud platform and supporting systems. You are responsible for designing, implementing, and continuously improving the infrastructure, tooling, and operational practices that enable our engineering teams to deliver reliable and secure products to our customers. As a cybersecurity company, security is a core responsibility of the role. You will work closely with Engineering, Security, and Product teams to establish secure-by-default platform standards, maintain a strong security posture, and ensure our cloud environment remains resilient against evolving threats. Intigriti embraces strong mentorship and professional development as part of its culture. As SRE Lead, you are responsible for supporting the growth of the SRE team through coaching, mentorship, regular one-on-one meetings, and technical leadership. You proactively improve the reliability and security of our platform through automation, platform engineering, infrastructure modernization, and security initiatives. You also coordinate operational and security incident response activities, drive continuous improvement, and ensure lessons learned are translated into lasting improvements across the organization. Technical Leadership Provide technical leadership to the SRE team and act as a trusted advisor to engineering teams across the business. Define and maintain platform engineering, cloud infrastructure, reliability, and security standards. Drive architectural decisions that improve scalability, reliability, maintainability, and security. Foster a culture of ownership, continuous improvement, collaboration, and operational excellence. Collaborate with Engineering, Security, Product, and Leadership teams to align platform initiatives with business objectives. Platform Engineering & Automation Drive the development of automation, self-service tooling, and infrastructure-as-code practices. Reduce operational overhead through automation and process improvement. Build and maintain reusable platform capabilities that enable engineering teams to operate efficiently and securely. Lead cloud platform modernization initiatives and continuously improve platform reliability and developer experience. Ensure infrastructure solutions remain cost-effective and operationally efficient. Reliability & Operational Excellence Lead the response and coordination of production incidents and major service disruptions. Drive root cause analysis and ensure identified improvements are implemented. Establish and maintain monitoring, alerting, observability, and operational health practices across the platform. Develop and maintain operational runbooks, recovery procedures, and platform documentation. Lead disaster recovery, backup, resilience testing, and business continuity initiatives. Ensure critical systems can be restored and operated effectively during failure scenarios. Platform Security Design and maintain secure-by-default cloud and platform architectures. Establish and enforce security best practices across cloud infrastructure, Kubernetes environments, networking, and identity management. Partner closely with the Security team to strengthen Intigriti's overall security posture. Drive infrastructure hardening, vulnerability remediation, secrets management, access control, and security automation initiatives. Support the implementation and operation of security tooling, including endpoint security, cloud security controls, logging, and detection capabilities. Participate in security architecture reviews and risk assessments for new platform initiatives. Help establish foundational cloud security practices and security guardrails across the organization. Security Readiness & Governance Support security incident response activities and coordinate platform remediation efforts. Contribute to tabletop exercises, disaster recovery exercises, and security readiness initiatives. Collaborate with Security, Compliance, and Engineering teams to support ISO27001, SOC2, customer security reviews, and audit activities. Help maintain evidence, documentation, and technical controls required to meet security and compliance obligations. Partner with internal stakeholders to continuously improve Intigriti's security posture and operational maturity. Mentorship & People Development Mentor and coach SRE team members, supporting their professional growth and development. Conduct regular one-on-one meetings and career development discussions. Support hiring, onboarding, and performance review activities. Encourage knowledge sharing and continuous learning across the team. Stay informed on emerging technologies, reliability practices, and security trends, sharing relevant insights with the organization. Leadership Skills Proven experience leading and mentoring a team of SREs or operations professionals. Ability to inspire and motivate team members to
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Intigriti? Share your experience