Senior Manager, Site Reliability Engineering, Follow Up Boss
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
The FUB Infrastructure (i12e) owns the core platform that powers Follow Up Boss, including: FUB application infrastructure across dev, QA, and production AWS accounts Legacy Partner Development infrastructure and critical shared services Observability, monitoring, and cost management for FUB workloads Incident handling and communication, including on-call, response, and follow-through Developer experience tooling: dev environments, onboarding, deployment, CI/CD, and automation FUB security posture: audits, compliance, app sec, tooling, and policy in partnership with ZG security teams The team partners closely with central Zillow platform orgs (database, networking, security, ZGCP/TE) and the broader FUB product engineering org to keep the system reliable, scalable, secure, and cost-effective while unblocking developer velocity. As a Senior Engineering Manager (M4) for FUB Infrastructure (SRE), you will lead a multidisciplinary team of SREs, SDEs, and security engineers responsible for the infrastructure, reliability, and developer experience that underpin Follow Up Boss. You will: Own team execution, quality, and innovation across multiple systems and workloads that support the entire FUB+ org. Move the infra team into a strong posture for proactive, roadmap-driven investments in reliability, scalability, security, and developer productivity. Drive cross-org alignment and strategy with ZG platform teams (ZGCP, SRE, networking, database, security) as well as FUB product teams to modernize FUB infrastructure and adopt shared platform capabilities where appropriate. This is an M4 scope role: you will be expected to consistently deliver through others (including senior ICs who do not report to you), shape technical strategy beyond your immediate team, and operate with limited oversight. Key Responsibilities (M4 Expectations + FUB Infra Needs) Team Execution & Roadmap Own execution for the FUB infra & security roadmap, turning strategic goals (e.g., DB scalability, ZGCP adoption, infra cost and reliability targets) into a sequenced, realistic plan with clear milestones and measures of success. Run an exemplary planning and delivery rhythm (quarterly), including estimation, risk management, dependency mapping, and stakeholder updates across FUB+ and central platform teams. Ensure the team hits commitments with rare surprises, and when risk emerges, proactively engage partners to adjust scope, resources, or timeline with clear communication and tradeoffs. Reliability, Quality & On-Call Be accountable for reliability, performance, operability, and cost of core FUB services and infrastructure (EC2, RDS/Aurora, Redis/Valkey, networking, queues, SRE tooling). Lead the team to run a proud, low-toil on-call process: well-defined SLOs and error budgets, actionable alerting, fast incident detection/response, high-quality RCAs, and follow-through on remediation work. Drive urgent, sustained progress on database scaling and performance, including capacity management, query and schema optimization, and modernization of data infrastructure. Infrastructure Modernization & Platform Strategy Lead the FUB modernization strategy and execution for prioritized workloads (e.g., workers, supporting services), balancing devex wins, reliability, and risk while coordinating with central teams. Partner with principal/staff engineers to refine FUB's service scaling strategy, ensuring clear guidance on when teams build in the monolith vs. new services, and how infra supports these choices. Developer Experience & Environments Raise the bar on developer environments and onboarding, reducing friction from dev boxes, tooling setup, and infra access; ensure new engineers can be productive quickly with reliable, self-service workflows. Drive faster, safer deployments by improving CI/CD (GitLab, pipelines, AMI replacements, canary/progressive delivery) and aligning with ZG best practices for trunk-based development and feature flags. Partner with product SDMs and tech leads to lower operational friction for dev teams (e.g., better runbooks, improved observability, easier infra integrations, automated guardrails and guardrails-powered AI tooling). Team Building, Coaching & Talent Lead and grow a high-performing, inclusive SRE/infrastructure/security team, set clear expectations, provide candid feedback, and manage performance. Develop technical leaders within and adjacent to the team (SREs, SDEs, security engineers, P5 ICs) through sponsorship, delegation, and stretch opportunities that expand impact beyond the immediate team. Hire, retain, and onboard talent across SRE, infra SDE, ensuring skills match the breadth of FUB infra (AWS, Terraform/Ansible, Kubernetes/ZGCP, observability, security, databases). Cross-Org Alignment & Strategy Be the primary technical and operational interface for FUB infra with FUB+ leadership and central Zillow platform orgs, driving alignment on priorities, tradeoffs, a