Sr. Technical Program Manager (TPM)
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
As a Senior Infrastructure Technical Program Manager (TPM) at Together AI, you will be at the core of building, optimizing, and scaling the global GPU resources needed for a pioneering AI infrastructure company. Your role is crucial in ensuring that the backbone of our AI models, thousands of GPUs distributed around the world, operates efficiently and reliably, enabling cutting-edge AI advancements that democratize access to AI technology globally. You will drive cross-functional excellence by streamlining critical workflows and enhancing communication across internal and external teams. Join top engineers, researchers, and innovators to shape the future of AI infrastructure and power the next generation of AI-driven solutions.
Responsibilities
- End-to-End Product Ownership: Own a comprehensive product roadmap, detailing key features, enhancements, and releases. Drive end-to-end product development, manage development and testing, and lead launches.
- Stakeholder Engagement : Engage with stakeholders to understand their needs, pain points, and feedback. Drive initiatives to enhance customer satisfaction and loyalty through product improvements and innovative solutions.
- Cross-Functional Execution: Lead and align diverse cross-functional teams - including Research, Engineering, DevOps, SRE, and Go-to-Market - to ensure seamless project delivery and organizational success.
Requirements
- ML Product or Infrastructure Experience : 5+ years of experience building and scaling AI/ML-powered products and infrastructure, specifically collaborating with research and engineering teams.
- Proven experience with large-scale technology deployments, including cloud computing platforms, decentralized cloud infrastructure, and distributed systems (e.g., containerization and orchestration tools).
- Familiarity with the technical domains of Observability, Storage, Network Engineering, and Security for infrastructure.Experience with cloud computing platforms, decentralized cloud infrastructure, and/or similar large-scale technology deployments.
- Familiarity with cloud-based technologies (e.g., AWS, Google Cloud, or Azure)
- Technical Foundation: Bachelor's or Master's degree in Machine Learning, Computer Science, Engineering, or a related field.
- Exceptional analytical and problem-solving skills, with a demonstrated ability to identify and proactively mitigate technical risks
- Experience using AI tools, such as ClaudeCode or similar, to accelerate analytical progress.
- Executive and Organizational Acumen
- Proven ability to thrive in a fast-paced, ambiguous startup environment, prioritizing complex tasks and managing multiple simultaneous projects.
- Strong organizational abilities to build cross-functional alignment and establish clear, focused priorities.
- A proactive and collaborative team-oriented approach, demonstrating a willingness to drive necessary outcomes across the company.
- Excellent communication and program management skills for effective collaboration with both internal stakeholders and external vendors.
- About Together AI
Benefits
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Together AI? Share your experience