Senior/Staff Applied GenAI Researcher - Enterprise Outcome Team

External

True Foundry · San Mateo, San Francisco Bay Area

Full-timeOn-site4mo ago

CI/CDDeep LearningDockergRPCKubernetesLeadership

Cover Letter Connect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role

Responsibilities

Build and productionize LLM-based and ML-based solutions, utilizing both open-source and proprietary models
Integrate TrueFoundry's platform seamlessly into customer environments and leverage it to expedite the time to value of developing these applications
Build agents, write prompts, eval sets, optimize inference time and response quality for applications
Write maintainable production-quality high-performance code frequently in Python
Build and optimize REST APIs , gRPC services , and data pipelines
Drive rapid feedback loops from customer deployments into continuous improvements for product and platform
Participate in solution architecture design , code reviews , and engineering best practices adoption

Requirements

4+ years experience building and deploying ML applications in production.
4+ years experience writing production code in python
2+ years working in deep learning and Natural language processing
1+ year experience building Agentic applications and GenAI Apps
Experience building REST APIs , working with Docker , and setting up CI/CD pipelines
Deep familiarity with Pytorch, HuggingFace libraries
Working knowledge of model servers like vLLM, Triton, TensorRT is preferred
Understanding of Kubernetes , distributed systems architecture , and cloud-native technologies is preferred
Strong system design abilities, with a focus on modular, reliable, and scalable architecture
Passionate about applying AI to solve real-world, cross-industry problems
Familiarity with LLM fine-tuning , RAG (Retrieval-Augmented Generation) , prompt engineering , or evaluation frameworks
Why Join TrueFoundry
Build foundational Applied GenAI solutions alongside world-class engineers (ex-Facebook Infrastructure leaders)
Work on real-world, high-impact problems across multiple industries
Collaborate directly with founders and early leadership on shaping company and product direction
Enjoy a flexible, ownership-driven work environment with rapid career growth
Weekly learning sessions, team-building activities, and startup mentorship opportunities
Learning credits and resources to help you grow your technical and professional skills

Benefits

Flexible schedule

Additional Information

About TrueFoundry Every production AI system, whether it's powering customer support, writing code, analyzing financial data, or diagnosing medical conditions, needs the same foundational infrastructure. A way to route between models. A way to manage tools and integrate them securely. A way to orchestrate agents and enforce governance. A unified compute layer to run it all. That infrastructure layer is being built right now. We're TrueFoundry, and we're building it. We're looking for a Senior/Staff Applied GenAI Researcher - Enterprise Outcome Team to join the team. The Problem We're Solving Companies are moving beyond simple chatbots to production agentic systems. These systems route between OpenAI, Anthropic, Google, and self-hosted models. They integrate dozens of tools via protocols like MCP. They orchestrate multi-agent workflows where agents coordinate with other agents. The infrastructure to support this doesn't exist yet. You can't just duct-tape together a few API calls and call it production-ready. You need a control plane that handles: Intelligent routing with observability, cost policies, and fallback logic Centralized tool and MCP server management with security and lifecycle controls Agent orchestration with governance and guardrails A unified compute layer to run self-hosted models, custom tools, and agents We've built two products to solve this: AI Gateway is the control plane, five composable components (Prompts, LLM Gateway, MCP Gateway, Guardrails, Agent Gateway) that handle routing, orchestration, and governance. AI Deploy is the compute layer, a Kubernetes-based platform that abstracts ML workloads as standard software primitives, so everything runs on unified infrastructure. We're Series A, backed by Intel Capital and Sequoia. Companies like CVS, Mastercard, Siemens, Paytm, Synopsys, and Zscaler run production AI workloads on our platform.

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at true foundry? Share your experience

Interested in this role?

Apply on the company's website.

Cover Letter Connect