Skip to main content
Back to jobs

Member of Technical Staff - Large Scale Data Infrastructure

External
blackforestlabs logoBlackforestlabs · Freiburg (germany), San Francisco (usa)
$180K–$300K/yrFull-timeOn-site2mo ago30+ days old, may be filled
AzureKubernetesPythonPyTorch
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • Scalable data loaders for training runs across thousands of GPUs
  • Efficient storage and retrieval systems for petabyte-scale datasets
  • Multi-cloud object storage abstraction
  • Execute large-scale data migrations across storage systems and providers
  • Debug and resolve performance bottlenecks in distributed data loading
  • Technical Focus
  • Python, PyTorch DataLoader internals
  • Object storage (e.g. S3, Azure Blob, GCS)
  • Parquet for metadata
  • Video: ffmpeg, PyAV, codec fundamentals

Requirements

  • Built and operated data pipelines at petabyte scale
  • Optimized data loading
  • Worked with petabyte-scale video and image datasets
  • Written processing jobs operating on millions of files
  • Debugged distributed system bottlenecks across large fleets of machines
  • Experience streaming dataset formats (e.g. WebDataset)
  • Video codec internals and frame-accurate seeking
  • Distributed systems experience
  • Slurm and Kubernetes for job orchestration
  • Experience with object storage performance tuning across providers
  • How We Work Together
  • Everything we do is grounded in four values:
  • Obsessed. We are a frontier research lab. The science has to be right, the understanding deep, the product beautiful.
  • Low Ego. The work speaks. The best idea wins, no matter who said it. Credit is shared. Nobody is above any task.
  • Bold. We take the ambitious bet. We ship, we do not wait for conditions to be perfect.
  • Kind. People over politics. We treat each other with genuine warmth. Agency without empathy creates chaos.
  • If this sounds like work you'd enjoy, we'd love to hear from you.
  • Base Annual Salary (SF based role) : $180,000-$300,000 USD + Equity

Benefits

Remote work optionsEquity / stock options

Additional Information

About Black Forest Labs We're the team behind Latent Diffusion, Stable Diffusion, and FLUX-foundational technologies that changed how the world creates images and video. We're creating the generative models that power how people make images and video-tools used by millions of creators, developers, and businesses worldwide. Our FLUX models are among the most advanced in the world, and we're just getting started. Headquartered in Freiburg, Germany with a growing presence in San Francisco, we're scaling fast while staying true to what makes us different: research excellence, open science, and building technology that expands human creativity. Why This Role We're looking for infrastructure engineers who want to work at peta-to-exabyte scale. You'll build the data systems behind the largest training runs on thousands of GPUs, where fixing one bottleneck lets researchers train the next breakthrough model.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at blackforestlabs? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect
Member of Technical Staff - Large Scale Data Infrastructure at Blackforestlabs