Skip to main content
Back to jobs

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

External
xpengmotors logoXpengmotors · Santa Clara, CA
$244K–$413K/yrFull-timeOn-site3w ago
Machine LearningPythonPyTorchReinforcement LearningRobotics
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • Reinforcement learning methods for LLM-driven agents and decision systems.
  • Policy optimization for long-horizon reasoning and planning.
  • Learning from human or AI feedback (RLHF / RLAIF).
  • Agent training pipelines built on top of our agent infrastructure platform.
  • Evaluation and benchmarking systems for agent capabilities.
  • Learning loops that integrate real-world and simulation data.
  • Contribute to AI systems that continuously improve after deployment .

Requirements

  • MS or PhD in Computer Science, AI, Machine Learning, Robotics, or a related field.
  • Strong background in reinforcement learning or machine learning.
  • Experience implementing RL algorithms such as PPO, Actor-Critic, or policy gradient methods.
  • Strong programming skills in Python with PyTorch or JAX.
  • Experience building ML training systems or infrastructure.
  • Experience with RLHF or preference learning.
  • Experience with LLM agents or tool-using AI systems.
  • Multi-agent systems or long-horizon planning.
  • Simulation environments for RL.
  • Publications in NeurIPS, ICML, ICLR, ACL , or related venues.
  • What do we provide:
  • A fun, supportive and engaging environment.
  • Opportunity to make significant impact on transportation revolution by the means of advancing autonomous driving.
  • Opportunity to work on cutting edge technologies with the top talent in the field.
  • Competitive compensation package.
  • Snacks, lunches and fun activities.

Benefits

Equity / stock optionsPerformance bonus

Additional Information

XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical take-off and landing (eVTOL) aircraft, and robotics. With a strong focus on intelligent mobility, XPENG is dedicated to reshaping the future of transportation through cutting-edge R&D in AI, machine learning, and smart connectivity. We are looking for exceptional Research Engineers / Scientists to design learning systems that allow agents to plan over long horizons, learn effective strategies, and improve through experience. This role sits at the intersection of reinforcement learning , large language models, and real-world autonomous systems . Autonomous systems must operate reliably in complex, dynamic environments. We believe the next generation of autonomy will involve learning agents that continuously improve through interaction, feedback, and large-scale data . You will help build the learning systems that power these agents .


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at xpengmotors? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect