Member of Technical Staff, ML Performance
ExternalFull-timeOn-site3mo ago
Machine LearningPerformance OptimizationPyTorchRobotics
Prepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
Odyssey is an AI lab pioneering general-purpose world models: causal, multimodal systems that learn to predict and interact with the world over long horizons, while generating real-time, interactive simulations from any starting point. This foundational technology promises to revolutionize robotics, science, healthcare, education, gaming, defense, and beyond.
Responsibilities
- Optimize models that will be used in real-time by hundreds of thousands of users.
- Design and implement distributed training strategies to reduce training time and resource consumption on large GPU clusters.
- Partner with our elite team of ML researchers and engineers to ensure model architectures are highly performant from conception.
- Develop sophisticated tools to identify performance bottlenecks and stability issues in both training and serving environments.
- Pioneer innovative approaches, frameworks, and system designs that enhance performance metrics across our model development and inference infrastructure.
- Have significant autonomy in technical decisions.
- Use the latest-generation GPUs.
Requirements
- 8+ years of software engineering experience, with significant work in ML performance.
- Deep insight into modern machine learning architectures with a natural instinct for performance optimization, particularly distributed training and inference.
- Track record of owning projects end to end.
- Problem-solving mindset with the ability to acquire new skills as needed.
- Proficiency with PyTorch (or TF/JAX) and Triton as well as NVIDIA GPU ecosystems and optimization stacks.
- Highly metric-based.
Benefits
Health insurance
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at odysseyml? Share your experience