Platform Engineering Manager
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
*Applicants must be authorized to work in the U.S. for any employer. *We cannot sponsor employment-based visas at this time. Let's Tango ! Where Innovation Meets Impact. At Tango Analytics, we're all about helping businesses make smarter decisions through powerful technology, insightful data, and a whole lot of collaboration. Whether you're a creative thinker, a strategic planner, a tech wizard, or a customer champion, there's a place for you on our team. We believe work should be meaningful and fun - so if you're ready to make a difference while enjoying the journey, come join us and let's Tango ! We are looking for a Platform Engineering Manager to join our dynamic and growing Platform Engineering team. About the Role: We are seeking a Platform Engineering Manager to build and operate our AI-native Internal Developer Platform (IDP)- the foundational layer that powers engineering velocity across the organization. You will own multi-cloud infrastructure (AWS & Azure), define golden paths, drive cloud modernization aligned to Well-Architected Frameworks, and deliver the observability, shared services, and agentic infrastructure that give every team a production-ready foundation. A defining dimension of this role is partnering with peer engineering leaders to actively migrate teams onto the platform and positioning it as the organization's AI-first engineering foundation. Key Responsibilities : Platform Strategy & Architecture Own and execute thePlatformroadmap: compute, networking, identity, observability, shared services, and AI/ML tooling across AWS and Azure Lead cloud modernization against the AWS and Azure Well-Architected Frameworks across all five pillars: operational excellence, security, reliability, performance efficiency, and cost optimization Define golden paths-standardized self-service workflows for service scaffolding, DB provisioning, environment spin-up, and AI workload deployment-with escape hatches for edge cases Own multi-cloud strategy; ensure consistent IAM, networking, and FinOps governance across providers IaC & CI/CD Automation DriveOpenTofu/Ansibleas source of truth for all infrastructure; enforce GitOps and policy-as-code for governance, auditability, and security Build and mature CI/CD pipelines (GitHub Actions, ArgoCD) to maximize deployment frequency, reduce lead time, and enable zero-ticket self-service provisioning Observability Own org-wide observability: metrics, logs, traces, and alerting-extended to AI/LLM signals (token usage, model latency, inference cost, agent task completion rates) Operate a centralized observability platform (Datadog/Signoz, OpenTelemetry, Grafana/Prometheus/Loki, or equivalent) delivered via golden paths; define SLIs/SLOs as onboarding defaults for all services Ensure full-stack coverage across infrastructure, Kubernetes, APM, distributed tracing, AI pipelines, and cost anomaly detection Shared Services Build and operate a self-service shared services catalog: secrets management, API gateways,model registries, and LLM gateways Rationalize duplicative per-team infrastructure; maintain shared services to production SLA standards with clear ownership and consistent security controls AI Platform & Agentic Infrastructure Own GPU/accelerated compute, model serving, vector databases, RAG pipelines, and LLM API gateway management (AWS Bedrock, Azure OpenAI, Anthropic) Build AI golden paths for self-service model deployment and LLM integration; design agentic infrastructure including orchestration runtimes, tool registries, memory/state services, and human-in-the-loop workflows Establish governance, cost controls, prompt injection guardrails, and model access policies for AI API usage and inference spend Partner with data science and ML engineering to translate agentic workflow requirements into reusable platform primitives Platform Adoption & Team Migration Collaborate onmigration program: partner with peer managers to plan and execute structured workload migrations onto the platform with hands-on support-not just documentation Define onboarding playbooks covering golden paths, shared services, observability setup, CI/CD cutover, and AI capability onboarding; track and report adoption metrics to leadership Identify and remove migration blockers-technical gaps, missing services, or organizational friction-and feed them into the platform roadmap Developer Experience, Leadership & Culture Build a self-service developer portal (Backstage, GitHubor equivalent) with service catalogs, golden paths, and AI/agentic workflow templates; track DORA metrics and developer experience KPIs Hire, develop, and retain high-performing platform engineers; build AI fluency across the team and foster a platform- as-a-product culture with feedback loops, OKRs, and iterative roadmapping Lead architecture reviews; make pragmatic build-vs-buy decisions; partner with security and compliance on governance priorities Security, Compliance & FinOps Embed secure
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Tango? Share your experience