Machine Learning Research Engineer Co-op

External

Red Hat · Boston

ContractRemoteToday

KubernetesLinuxLLMsMachine LearningPython

Prepare for this interview

AI-generated questions, company research, and talking points tailored to this role

Responsibilities

Research via experimentation and theoretical modeling the network bandwidth requirements and trade-offs in Prefill-Decode (P/D) disaggregated LLM serving.
Research and implement networking techniques/methods for high-performance KV cache transfers in deployment setups without RDMA networking.
Conduct experiments to evaluate the impact of newly developed non-RDMA KV Cache transfer techniques on performance (latency and throughput) in P/D LLM serving.