Research Intern - Applied Reinforcement Learning
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
PhD Research Intern - Applied Reinforcement Learning Centific AI Research Role Summary Centific AI Research seeks a PhD Research Intern to design and evaluate reinforcement learning (RL) systems for agentic AI workflows. You will develop RL environments, reward models, and post-training pipelines for LLM-based agents, translating research into practical enterprise solutions. Scope of Work - End-to-end RL pipelines for agentic systems (simulation → training → evaluation) - Alignment of LLM-based agents using RLHF, DPO, PPO, and emerging methods - Design of reward functions, verifiers, and evaluation frameworks - Simulation environments (digital twins) for enterprise workflows - Scalable training and inference for RL-based systems Example Projects - Build a custom RL environment simulating a real-world enterprise workflow and train an agent using PPO or GRPO - Develop a reward modeling pipeline from human feedback and evaluate alignment improvements - Create an evaluation harness measuring reasoning, task success, and policy safety - Prototype an agentic system with tool use and multi-step reasoning, integrated with RL training - Document experiments, ablations, and findings for research and productionization