Executive Director, Platform Governance & Strategy
ExternalFull-timeRemote1w ago
ComplianceIncident ResponseJiraKafkaKubernetesLeadership
Prepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- The ideal candidate brings deep technical credibility, executive communication skills, and a proven track record of operationalizing reliability, architectural governance, and compliance programs at scale in a regulated environment.
- Primary Duties and Responsibilities:
- To perform this job successfully, an individual must be able to perform each primary duty satisfactorily.
- Site Reliability Engineering
- Lead the scaling and maturation of the SRE practice, establishing error budgets, SLOs, SLAs, and incident response frameworks across all platform services.
- Define and enforce reliability standards including on-call models, blameless postmortem processes, and corrective action tracking to drive continuous improvement.
- Partner with Platform Foundation teams (Kubernetes, Kafka, FinOps/Security) to embed reliability principles into build and operate models.
- Champion toil reduction through automation, ensuring engineering capacity is redirected from manual operations to higher-value platform capabilities.
- Platform Engineering Governance & Compliance
- Serve as Product Manager for the FinOps and SecOps domains within Platform Engineering, owning the product vision, prioritization, and stakeholder alignment for governance tooling and practices.
- Establish and maintain a governance framework ensuring Platform Engineering adheres to organizational standards across incident and problem management, SORTs, risk tracking, and audit findings.
- Own the end-to-end process for PE compliance obligations, ensuring timely resolution and closure of incidents, problem tickets, risk items, and audit observations with clear accountability and tracking.
- Partner with Risk, Compliance, and Security functions to proactively identify governance gaps, drive remediation, and ensure PE operates within the organization's risk appetite.
- Maintain visibility and reporting on PE's compliance posture across all obligation types, surfacing trends, aging items, and residual risks to CARE leadership and relevant stakeholders.
- Site Reliability Engineering COE
- Lead the scaling and maturation of the SRE practice, establishing error budgets, SLOs, SLAs, and incident response frameworks across all platform services.
- Define and enforce reliability standards including on-call models, blameless postmortem processes, and corrective action tracking to drive continuous improvement.
- Partner with Platform Engineering Product teams (Kubernetes, Kafka, FinOps/Security) to embed reliability principles into build and operate models.
- Champion toil reduction through automation, ensuring engineering capacity is redirected from manual operations to higher-value platform capabilities.
- Cloud Strategy & Architecture
- Define and execute the multi-year cloud architecture strategy aligned to business growth, scalability, regulatory compliance, and cost optimization goals.
- Establish cloud architectural standards, reference architectures, and governance frameworks (landing zones, identity, network patterns, service catalog) and drive adoption across engineering.
- Guide cloud-native architecture decisions including containers/orchestration, IaaS/PaaS adoption, disaster recovery, and multi-region patterns with a steady eye on regulatory requirements (e.g., CIS, NIST).
- Oversee technology roadmaps and end-of-life planning for cloud platform components, ensuring forward-looking decisions balance innovation with operational stability.
- Serve as a key technical advisor to senior leadership, translating complex architectural trade-offs into clear business decisions.
- Metrics & Reporting
- Own the platform metrics and reporting function, establishing a consistent framework for measuring platform health, engineering velocity, reliability, and cost efficiency across CARE.
- Define and track KPIs aligned to internal SLAs, executive reporting needs, and audit/compliance requirements.
- Ensure Jira and other plat
Benefits
Health insuranceVision insurance
Additional Information
To be considered for this position, applications and resumes are accepted only through our careers site by directly applying to the posted job. We do not accept unsolicited resumes or sales solicitations from staffing agencies. Any OCC employee wishing to submit a referral must do so through their Workday account. Any resume submitted outside of an active job posting will not be considered for employment.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at theocc? Share your experience