Skip to main content
Back to jobs

Member of Technical Staff - Compilers

External
gimlet logoGimlet · San Francisco
Full-timeOn-site3mo ago
Python
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

Gimlet is building the next generation of AI infrastructure: large-scale AI datacenters and the orchestration platform that coordinates them. The future of AI will require vastly more compute than exists today. But as AI workloads become more complex and new hardware architectures emerge, simply deploying more GPUs isn't enough. The challenge is making increasingly diverse compute work together. Gimlet's platform intelligently partitions and routes workloads across heterogeneous hardware, enabling step-function improvements in performance and efficiency. Customers deploy through production-grade APIs without needing to think about hardware selection, placement, or optimization. We work with foundation labs, hyperscalers, and AI-native companies to power production workloads at massive scale and help define the infrastructure layer for the future of AI. Gimlet Labs is seeking a Member of Technical Staff focused on compiler infrastructure for ML execution systems, spanning IR transformations, runtime systems, kernel orchestration, scheduling, and serving optimization. You will help build the execution stack that transforms modern AI workloads into efficient programs running across heterogeneous hardware. The work spans runtime systems, compiler infrastructure, scheduling, memory movement, kernel orchestration, and serving optimization for large-scale inference workloads. This is not a traditional language compiler or backend code generation role. We are looking for engineers who think deeply about execution behavior: IR transformations, runtime optimization, scheduling, memory locality, kernel composition, distributed execution, and heterogeneous serving infrastructure. https://gimletlabs.ai/blog/low-latency-spec-decode-corsair

Responsibilities

  • Design and implement compiler and runtime pipelines for large-scale AI inference workloads
  • Build and evolve IR transformations, lowering passes, and execution optimizations across graph, tensor, and kernel representations
  • Optimize execution for latency, throughput, memory efficiency, and heterogeneous hardware utilization
  • Develop scheduling, partitioning, and kernel orchestration strategies across accelerators and serving runtimes
  • Work on execution systems spanning compiler infrastructure, runtime behavior, memory movement, and kernel dispatch
  • Integrate new model architectures, execution patterns, and serving optimizations into the stack
  • Collaborate closely with systems, runtime, and kernel engineers to ensure correctness and performance across the full execution pipeline
  • You may be a good fit if
  • Strong systems and performance engineering fundamentals
  • Experience building compiler systems, compiler-adjacent infrastructure, or execution/runtime systems
  • Experience implementing IR transformations, compiler passes, lowering logic, or code generation systems
  • Ability to reason about execution behavior, memory systems, scheduling, and hardware efficiency
  • Strong software engineering skills in C++ and/or Python
  • Strong candidates may also have
  • Experience with MLIR, LLVM, XLA, TVM, Triton, or similar compiler/runtime infrastructure
  • Experience optimizing ML inference or serving workloads
  • Familiarity with runtime systems, kernel dispatch, launch APIs, or memory allocators
  • Experience working with GPUs, AI accelerators, or heterogeneous hardware systems
  • Experience profiling and debugging performance-critical systems
  • Familiarity with scheduling, partitioning, or kernel-level optimizations
  • What Makes Gimlet Different
  • As an early member of the team, you will have significant ownership, work alongside highly technical engineers, and help shape both the systems we build and how we scale the company.
  • We value people who are excited to work across domains, take ownership of meaningful problems, and build technology that enables the next generation of AI.

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at gimlet? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect