Skip to main content
Back to jobs

Media Software Engineer, Speech (All Levels)

External
cantina logoCantina · San Francisco
$120K–$180K/yrFull-timeRemote6mo ago
AndroidiOSMachine LearningNode.js
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

The Media Team at Cantina is building the real-time infrastructure powering live conversations between people and AI. Our goal is simple but technically challenging: make interacting with AI feel fast, natural, and truly conversational. We're looking for a Software Engineer to help improve the speech, audio, and media systems at the heart of the Cantina experience. A major focus of this role is reducing latency and improving responsiveness so AI bots can hear users, process intent, and respond in real time - without awkward pauses or delays. This team works across everything from low-level media pipelines and WebRTC frameworks to globally distributed infrastructure supporting real-time voice and video interactions across iOS, Android, and web. If you're excited by high-performance C++, real-time systems, speech technologies, and building the future of conversational AI, we'd love to talk.

Responsibilities

  • Improve the real-time speech and media systems powering live AI conversations.
  • Reduce latency and optimize responsiveness across audio streaming and speech pipelines.
  • Build new voice and video capabilities that enable more immersive interactions between users and AI bots.
  • Improve and extend our custom WebRTC infrastructure across iOS, Android, and web.
  • Work closely with product and platform teams to shape the future of conversational AI experiences.
  • What You'll Bring: We welcome applicants across a wide range of experience levels, from new graduates to senior engineers. Responsibilities and leveling will be tailored to match the candidate's background.
  • These are the minimum qualifications:
  • BS or MS in Computer Science, Computer Engineering, or a related field; or equivalent experience.
  • Excellent communications skills.
  • Experience with C or C++.
  • Strong computer science fundamentals, including familiarity with data structures and concurrent / multithreaded programming.
  • Exposure to system programming concepts, including network protocols; memory management; and distributed systems fundamentals.
  • Object-oriented programming and design skills.
  • Interest in solving challenging, subtle engineering problems.
  • These are the preferred qualifications:
  • Previous experience with WebRTC, streaming protocols, or other media-related technologies.
  • Familiarity with audio or video processing techniques and algorithms.
  • Experience creating backend server infrastructure.
  • Experience developing software for iOS and Android.
  • Familiarity with building services using Node.js.
  • Familiarity with artificial intelligence and machine learning techniques, particularly in relation to speech recognition and synthesis.
  • Location:
  • While we offer fully remote and hybrid employment opportunities, our Media Engineering team strongly desires candidates to be available (or willing to relocate) to work in the Bay Area. For reference, 95% of the Media Engineering team works from the Bay Area.

Benefits

The anticipated annual base salary range for this role is between $120,000-$180,000. When determining compensation, a number of factors will be considered, including skills, experience, job scope, location, and competitive compensation market data.Competitive salary and generous company equityMedical, dental, and vision insurance - 99.99% of premiums covered by Cantina42 days of paid time off, including:15 PTO days10 sick days15 company holidays2 floating holidaysGenerous parental leave & fertility support401(k) retirement savings planLifestyle spending account - $500/month to use however you'd likeComplimentary lunch and snacks for in-office employeesOne Medical membership, and more!Dental insuranceVision insurance401(k)Paid time offRemote work optionsEquity / stock optionsParental leave

Additional Information

About Cantina: Cantina Labs is a social AI company, developing a suite of advanced real-time models that push the boundaries of expression, personality, and realism. We bring characters to life, transforming how people tell stories, connect, and create. We build and power ecosystems. Cantina, our flagship social AI platform, is just the beginning. If you're excited about the potential AI has to shape human creativity and social interactions, join us in building the future!


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at cantina? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect