Skip to main content
Back to jobs

Lead Audio ML Engineer

External
Hark logoHark · San Jose
Full-timeOn-site1d ago
Deep LearningPyTorchTensorFlow
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

We are looking for an Lead Audio ML Engineer to implement, train, and ship audio models that run on-device across our consumer products. This role spans the full lifecycle of on-device audio intelligence: model design, training, evaluation, and deployment to constrained hardware. You will work alongside our DSP, firmware, and product teams to turn audio model research into production features that ship at scale.

Responsibilities

  • Implement and train audio models for wake-word detection, voice activity detection, source separation, speech enhancement and similar audio
  • Take models from research prototype to on-device deployment within latency, memory, and power budgets
  • Build and maintain training data pipelines, evaluation harnesses, and re-training cadence across model families
  • Partner with DSP and firmware engineers to integrate models into the Hark Audio Engine and DSP runtime
  • Collaborate with hardware and acoustics teams to characterize the signal conditions models must operate under
  • Profile and optimize models on target platforms (DSP, NPU, CPU) and define accuracy and resource budgets per product

Requirements

  • 3+ years of professional experience building and shipping audio or speech ML models
  • Strong fluency in PyTorch or TensorFlow and modern audio deep learning toolchains
  • Hands-on experience deploying models to embedded targets such as DSP, NPU, or mobile NPU and CPU
  • Comfort working across the full ML lifecycle: data, training, evaluation, deployment, and monitoring
  • Solid foundation in audio signal processing concepts and how they intersect with ML pipelines
  • Experience collaborating with DSP, firmware, and hardware engineers on resource-constrained systems
  • Bonus Qualifications
  • Background shipping voice-first or far-field audio products
  • Experience with on-device wake-word, ASR front-ends, or speech enhancement at production scale
  • Familiarity with model compression techniques such as quantization, pruning, and distillation
  • Familiarity with Qualcomm AI stacks or similar alternatives from other providers
  • Open-source contributions to audio ML projects

Benefits

The US base salary range for this full-time position is between $120,000 and $300,000 annually.Vision insurancePerformance bonus

Additional Information

About Hark Hark is an artificial intelligence company building advanced, personalized intelligence. One that is proactive, multimodal, and capable of interacting with the world through speech, text, vision, and persistent memory. We're pairing that intelligence with next-generation hardware to create a universal interface between humans and machines. While today's AI largely operates through chat boxes and decade-old devices, Hark is focused on what comes next: agentic systems that interact naturally with people and the real world. To get there, we're developing multimodal models and next-generation AI hardware together - designed from the ground up as a single, unified interface for a new era of intelligent systems.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Hark? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect