Skip to main content
Back to jobs

Principal Research & Engineering, Realtime Voice AI

External
Inflection AI logoInflection Ai · Palo Alto, CA
$400K–$550K/yrFull-timeOn-siteToday
LLMsSAFe
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

Voice is becoming the highest-stakes interface for AI, where quality depends on speed, naturalness, interruption handling, emotional nuance, and reliability in real-world conditions. We are looking for a hands-on technical leader to define and build Inflection's realtime Voice AI stack across speech models, streaming systems, voice-agent runtime, and evaluation. This person will help shape how emotionally intelligent AI shows up in spoken interactions, partnering across research, engineering, product, and design to deliver voice agents that feel responsive, trustworthy, and useful in enterprise settings.

Responsibilities

  • Establish the technical roadmap for Inflection's realtime Voice AI stack, encompassing streaming ASR, TTS, speech-to-speech, speech LLMs, turn-taking, barge-in, latency, and reliability.
  • Utilize a 1,000 GPU cluster to support performance benchmarking and extensive experimentation.
  • Determine build-vs-buy-vs-train strategies for core audio, speech, and realtime interaction components.
  • Direct research and engineering efforts focused on speech quality, naturalness, expressiveness, emotional fit, controllability, and production readiness.
  • Collaborate with infrastructure, product, design, and agentic AI teams to deploy voice agents for enterprise workflows.
  • Develop evaluation systems measuring voice quality through metrics such as clarity, emotional appropriateness, interruption handling, task success, user preference, latency, and reliability, moving beyond standard WER.
  • Refine production voice behavior by debugging across runtime, model, evaluation, data, and product layers.
  • Mentor, and Coach a team specializing in speech research, audio infrastructure, realtime systems, and evaluation.

Requirements

  • Experience leading or serving as a principal Research and Engineering contributor to realtime voice, speech, audio AI, or conversational AI systems in production.
  • Experience with one or more of: streaming ASR, TTS, speech-to-speech systems, speech LLMs, audio tokenization, multimodal models, barge-in, low-latency inference, or realtime agents.
  • Strong technical judgment across both speech modeling and production systems.
  • Ability to define voice quality in terms of user and customer outcomes, not only offline model metrics.
  • Experience designing or using evaluation systems that capture real user experience.
  • Strong product intuition for natural, trustworthy, emotionally appropriate voice interactions.
  • Ability to lead senior technical talent while staying close to the code, architecture, and debugging work.
  • Have a bachelor's degree or equivalent in a related field to the offered position requirements
  • Employee Pay Disclosures

Benefits

Inflection AI values and supports our team's mental and physical health. We are focused on building a positive, safe, inclusive and inspiring place to work. Our benefits include:Diverse medical, dental and vision options401k matching programUnlimited paid time offParental leave and flexibility for all parents and caregiversSupport of country-specific visa needs for international employees living in the Bay AreaHealth insuranceDental insuranceVision insurance401(k)Equity / stock optionsParental leave

Additional Information

About Inflection AI Inflection AI is a Public Benefit Corporation empowering people with human-centered, emotionally intelligent AI. We're shaping the future of AI by combining emotional intelligence (EQ) and raw intelligence (IQ) to elevate people's potential. Inflection AI created Pi, the world's first emotionally intelligent AI, to help people work through decisions, emotions, and challenges. Pi is a personal AI agent powered by Inflection AI's foundation model, proving that AI can be personal, empathetic, and contextually aware.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Inflection AI? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect