Skip to main content
Back to jobs

Lead AI/ML Engineer

External
toyota logoToyota · Plano, TX
Full-timeOn-site2w ago
AWSCI/CDCloudFormationComplianceDockerDocumentation
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

Collaborative. Respectful. A place to dream and do. These are just a few words that describe what life is like at Toyota. As one of the world's most admired brands, Toyota is growing and leading the future of mobility through innovative, high-quality solutions designed to enhance lives and delight those we serve. We're looking for talented team members who want to Dream. Do. Grow. with us. An important part of the Toyota family is Toyota Financial Services (TFS), the finance and insurance brand for Toyota and Lexus in North America. While TFS is a separate business entity, it is an essential part of this world-changing company- delivering on Toyota's vision to move people beyond what's possible. At TFS, you will help create best-in-class customer experience in an innovative, collaborative environment. To save time applying, Toyota does not offer sponsorship of job applicants for employment-based visas or any other work authorization for this position at this time . This position is based in Plano, TX. Who We're Looking For: At TFS, we're embarking on a technology transformation journey, creating next generation products and platforms. These products enable TFS to provide a best-in-class experience to our customers and partners and position us to rapidly scale to realize our vision of mobility for all by enabling freedom of movement for everyone. We are seeking a Lead GenAI D eveloper to d esign and lead cloud-native architectures on AWS that power GenAI applications and LLM-based systems . The Lead GenAI Developer will build ingestion pipelines, integrate vector databases, operate MCP servers and serverless components, and drive prompt engineering and production LLM usage. The expectation is as a Lead you will be hands-on and driving the delivery of critical business initiative s .

Responsibilities

  • Cloud Architecture & Infrastructure: Design scalable, secure AWS architectures (VPCs, IAM, networking, S3, EKS/ECS/Fargate, Lambda) and manage infrastructure using IaC (Terraform/CloudFormation), CI/CD, and observability tooling.
  • LLM & GenAI Platforms: Lead integration of API-based and self-hosted LLMs, implement RAG solutions, optimize inference cost/performance, and operate MCP servers and model-serving infrastructure.
  • Prompting & Evaluation: Develop prompt engineering strategies, reusable templates, and evaluation frameworks; collaborate with product teams to iterate and improve prompt quality.
  • Vector Databases & Retrieval Pipelines: Implement and maintain vector stores (OpenSearch, Pinecone, Milvus, Qdrant ) and design efficient similarity search, retrieval workflows, and indexing strategies.
  • Data Ingestion & Processing Pipelines: Build robust ETL/ELT and document ingestion systems-batch and streaming-including data transformation, cleaning, metadata extraction, and embedding generation.
  • Microservices & Serverless Systems: Develop microservices and serverless components to support low-latency inference, asynchronous processing, and event-driven architectures.
  • Python Development & AI Tooling: Build core backend components in Python, leveraging LangChain , LlamaIndex , Hugging Face, and related tooling.
  • Security, Governance & Cross-Functional Leadership: Enforce data privacy, secrets management, RBAC, and cost governance best practices while collaborating across teams, mentoring engineers, and maintaining operational runbooks.
  • What You Bring:
  • Bachelor's Degree and/ or equivalent experience.
  • 6+ years cloud architecture experience, including 3+ years building production GenAI/LLM systems on AWS.
  • Strong Python and AWS expertise , including Lambda, ECS/EKS, S3, SageMaker (or similar), plus Docker and Kubernetes.
  • Production experience with vector databases and designing ingestion + embedding pipelines for both batch and streaming workloads.
  • Hands-on with prompt design, evaluation, LLM orchestration, and RAG implementation patterns.
  • Experience deploying and operating model- serving or MCP - like server infrastructure (selfhosted or managed).
  • Proficient with IaC and delivery tooling, including Terraform/CloudFormation, GitOps , and CI pipelines.
  • Proven skills in monitoring, logging, and automated testing for ML infrastructure and services.
  • Strong communicator with clear documentation skills.
  • Added Bonus If You Have:
  • Experience with model-serving infrastructure, such as Amazon SageMaker, NVIDIA Triton, Ray Serve, or similar platforms.
  • Hands-on experience with GenAI libraries and frameworks, including LangChain , LlamaIndex , Hugging Face, and OpenAI APIs.
  • Deep operational expertise with vector databases, such as Pinecone, Milvus, Weaviate , or Qdrant .
  • Knowledge of data governance and regulatory compliance, including HIPAA and GDPR considerations.
  • AWS Solutions Architect, AWS DevOps Engineer, or equivalent industry certifications.
  • What We'll Bring
  • During your interview process, our team can fill you in on all the details of our industry-leading benefits and

Benefits

Vision insurancePerformance bonus

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at toyota? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect