Skip to main content
Back to jobs

Senior/Staff ML Engineer, Performance Optimization

External
comfy-org logoComfy-org · San Francisco
Full-timeOn-site12mo ago
LessPyTorch
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

We're looking for someone who loves optimizing model inference to join us in building the core of ComfyUI - the most complex and bleeding-edge part of our engine. You'll be working on making AI models run faster and more efficiently than anyone thought possible. You are a good fit if this describes you: You geek out about model inference, torch optimizations, and memory management You've written production PyTorch code that pushes performance boundaries You love diving deep into how models actually work under the hood You get excited about making insanely optimized code that just works You think the current state of ML deployment could be way better

Responsibilities

  • Build and optimize the core inference engine that powers ComfyUI
  • Make massive models run faster and use less memory than anyone else
  • Work directly with our core team on architecting new features
  • Tackle the hardest technical problems in the visual AI space
  • Help shape where we take this technology next
  • Bonus: If you've worked with diffusion/LLM models before or built custom nodes for ComfyUI, that's awesome

Benefits

Performance bonus

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at comfy-org? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect