Skip to main content
Back to jobs

Member of Technical Staff - ML Training Systems

External
modal logoModal · NY
Full-timeOn-site3mo ago
LinuxMachine LearningSeaborn
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

AI needs a new infrastructure layer. We're building it at Modal. Every era of computing brought new workloads that previous infrastructure couldn't support: mainframes, databases, and the cloud. Each time, the company that rebuilt the layer underneath defined the decade. AI is no different, except it touches everything instead of one slice, and the window to build the layer underneath it is open right now. Our customers include category-defining companies like Lovable , Ramp , Cognition, DoorDash, and Suno. They rely on Modal for instant GPU access, sub-second container starts, and native storage, so it's simple to serve low-latency inference, fine-tune models, and access production-ready sandboxes at scale. We recently raised a $355M Series C at a $4.65B valuation, led by General Catalyst and Redpoint Ventures. We've crossed $300M+ ARR and grown fivefold since September. Our team includes creators of popular open-source projects (e.g., Seaborn , Luig i ), academic researchers, international olympiad medalists, and experienced engineering and product leaders with decades of experience. We are looking for strong engineers with experience training production machine learning models. If you are interested in contributing to open-source projects and evolving Modal's infrastructure to train the next generation of language models, we'd love to hear from you!

Requirements

  • 5+ years of experience writing high-quality, high-performance code.
  • Experience working with torch and high-level training frameworks (Huggingface, verl, slime)
  • Experience with ML training optimization (tell us a story about eliminating data loading bottlenecks, overlapping communications with compute, rewriting a trainer to handle off-policy rollouts, etc.)
  • Nice-to-have: familiarity with low-level operating system foundations (Linux kernel, file systems, containers, etc).
  • Ability to work in-person, in our NYC or San Francisco office.
  • Ability to participate in on-call rotation and respond to production incidents.

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at modal? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect
Member of Technical Staff - ML Training Systems at Modal