Skip to main content
Back to jobs

Staff Machine Learning Engineer, AI Serving

External
Reddit logoReddit · Remote
$253K–$355K/yrFull-timeRemote1mo ago
AWSCachingGenerative AIKubernetesLLMsMachine Learning
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

The Machine Learning Platform team at Reddit is a high-impact team that owns the infrastructure that powers recommendations, content discovery, user and content quantification, while directly impacting other teams such as Growth, Ads, Feeds, and Core Machine Learning teams.

Responsibilities

  • As a Staff Machine Learning Engineer, you will lead the development of a large-scale ML Inference Platform at Reddit.
  • Lead the end-to-end design, implementation, and maintenance of a highly available, low-latency GPU-based model serving system for search, ranking, and LLMs supporting Millions of QPS.
  • Design and develop ML and Generative AI systems in cloud-based production environments on Kubernetes at scale.
  • Rapidly develop prototypes and develop a high-performance feature hydration and processing system as a part of the inference stack - including routing, caching, and batching.
  • Lead a unified GPU model export framework to support converting trained models into optimized GPU inference models.
  • Strong understanding of real-time ML observability to track feature/model performance.
  • Experience working with LLM serving online at scale.
  • Built an E2E inference performance benchmarking framework
  • Deep Understanding of multi-cluster compute environment and network topology that is specific to ML inference use cases.
  • Who You Might Be:
  • 7+ years of experience in ML Engineering, AI Platform Engineering, or Cloud AI Deployment roles.
  • Have experience operating orchestration systems such as Kubernetes at scale
  • Deep experience with cloud-based technologies for supporting an ML platform, including tools like AWS, Google Cloud Storage, infrastructure-as-code (Terraform), and more
  • Proficiency with the common programming languages and frameworks of ML, such as Go, Python, etc.
  • Excellent communication skills with the ability to articulate technical AI concepts to non-technical stakeholders
  • Strong focus on scalability, reliability, performance, and ease of use. You are an undying advocate for platform users and have a deep intuition for the genAI product development lifecycle.
  • Strong knowledge of model serving, inference pipelines, monitoring, and observability for AI systems is a plus
  • Strong proficiency in Python and deep experience with modern AI/ML frameworks (Triton, Dynamo, vLLM, Pytorch)

Benefits

Comprehensive Healthcare Benefits and Income Replacement Programs401k with Employer MatchGlobal Benefit programs that fit your lifestyle, from workspace to professional development to caregiving supportFamily Planning SupportGender-Affirming CareMental Health & Coaching BenefitsFlexible Vacation & Paid Volunteer Time OffGenerous Paid Parental LeavePay Transparency:This job posting may span more than one career level.The base salary range for this position is:$253,300 - $354,600 USDIn select roles and locations, the interviews will be recorded, transcribed and summarized by artificial intelligence (AI). You will have the opportunity to opt out of recording, transcription and summarization prior to any scheduled interviews.Health insuranceDental insuranceVision insurance401(k)Paid time offFlexible scheduleEquity / stock optionsParental leave

Additional Information

Reddit is a community of communities. It's built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 126 million daily active unique visitors, Reddit is one of the internet's largest sources of information. For more information, visit www.redditinc.com .


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Reddit? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect