Skip to main content
Back to jobs

Multimodal AI Systems Architect (AI Engineering)

External
hyphenconnect logoHyphenconnect · Hong Kong
Full-timeOn-site1mo ago
RAG
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • Integrate vision encoders and audio-native models into core agent reasoning loops.
  • Optimize streaming latency for voice-to-voice AI interactions.
  • Architect multimodal RAG systems capable of retrieving insights from videos and PDFs.

Requirements

  • Experience with Whisper, CLIP, and multimodal LLM integration.
  • Knowledge of streaming architectures and WebRTC.
  • Expertise in cross-modal alignment.

Benefits

Vision insurance

Additional Information

We are seeking a talented Multimodal AI Systems Architect to develop and optimize AI systems that seamlessly integrate vision and audio models. This role focuses on enhancing our voice-to-voice interactions and multimodal retrieval capabilities, ensuring our systems are efficient and innovative.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at hyphenconnect? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect