Skip to main content
Back to jobs

Senior Applied AI Engineer - Multimodal Transformers

External
kodiak logoKodiak · San Francisco Bay Area
$200K–$260K/yrFull-timeOn-site1mo ago
Deep LearningPythonPyTorchRoboticsSAFeTensorFlow
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Requirements

  • BS, MS, or PhD in AI, Computer Science, or a related field
  • 4+ years experience with transformer architectures, particularly in multimodal or multi-stream settings
  • Familiarity with cross-attention, token fusion, or modality alignment techniques
  • Proficiency in Python and deep learning frameworks like PyTorch or TensorFlow
  • Strong understanding of scalable training for large models, including distributed training and mixed-precision optimization
  • Passion for building AI that reasons over the full breadth of sensory input to operate safely in the real world

Benefits

Competitive compensation package including equity and annual bonusesExcellent Medical, Dental, and Vision plans through Kaiser Permanente, Cigna, and MetLife (including a medical plan with infertility benefits)MetLife Legal Services, Identity & Fraud Protection, Hospital Indemnity Insurance, Accident Insurance, & Critical Illness InsuranceFlexible PTO, 10 paid holidays, and generous parental leave policiesOur office is centrally located in Mountain View, CAOffice perks: dog-friendly, free catered lunch, a fully stocked kitchen, and free EV chargingLong Term Disability, Short Term Disability, Life InsuranceWellbeing Benefits - Headspace through Cigna, Calm through Kaiser, One Medical, Gympass, Spring Health through Cigna, Rula (mental health navigation)Fidelity 401(k)Commuter, FSA, Dependent Care FSA, HSAVarious incentive programs (referral bonuses, patent bonuses, etc.)California Pay Range$200,000 - $260,000 USDHealth insuranceDental insuranceVision insurance401(k)Paid time offFlexible scheduleEquity / stock optionsPerformance bonusParental leave

Additional Information

Kodiak Robotics, Inc. was founded in 2018 and has become a leader in autonomous ground transportation committed to a safer and more efficient future for all. The company has developed an artificial intelligence (AI) powered technology stack purpose-built for commercial trucking and the public sector. The company delivers freight daily for its customers across the southern United States using its autonomous technology. In 2024, Kodiak became the first known company to publicly announce delivering a driverless semi-truck to a customer. Kodiak is also leveraging its commercial self-driving software to develop, test and deploy autonomous capabilities for the U.S. Department of Defense. Kodiak's autonomy stack is built on AI that fuses diverse sensor streams into a unified, actionable understanding of the world. We are developing GigaFusionNet - a large-scale multimodal transformer that learns rich, joint representations across camera, LiDAR, and radar through attention-based fusion. We are looking for engineers to push the boundaries of how transformer architectures combine and reason over heterogeneous sensor data.This role is open to all levels - from those eager to contribute to cutting-edge research to experts driving innovation at scale. In this role, you will: Design and develop multimodal transformer architectures that fuse camera, LiDAR, and radar into unified representations Research and implement cross-modal attention mechanisms, token fusion strategies, and efficient multi-stream tokenization Build scalable training pipelines for large-scale multimodal transformers across massive real-world datasets Explore self-supervised and contrastive pretraining objectives that learn transferable multimodal representations Optimize transformer models for real-time inference under latency and compute constraints


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at kodiak? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect