Skip to main content
Back to jobs

Senior Backend Engineer, ML Inference Systems

External
Unity logoUnity · Mountain View, CA
Full-timeOn-site1mo ago
CI/CDDockerGCPGrafanaKubernetesMachine Learning
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

Every day, we connect billions of players with the games and experiences they love. Our Vector Gamer AI team sits at the heart of that mission, governing ad ranking and bidding decisions across billions of daily impressions, where large-scale machine learning and real-world impact converge at scale. We're hiring a Senior Backend Engineer to build and operate the infrastructure those models depend on. You'll design and operate the distributed systems that power billions of daily decisions, with a focus on the performance, reliability, and scalability of inference systems. Join us and help influence how billions of gaming experiences are discovered, monetized, and how creators are rewarded.

Responsibilities

  • Design, develop, and deploy production-grade backend services and distributed systems powering large-scale online model inference at billions of daily requests
  • Drive technical direction of our inference platform, with a focus on low-latency, high-throughput serving infrastructure
  • Partner with ML engineers to ensure online serving infrastructure scales with growing model complexity and inference volumes, without compromising latency or throughput
  • Ensure the reliability, scalability, and efficiency of our systems in production using monitoring and observability tools like Prometheus and Grafana
  • Manage and optimize cloud infrastructure on GCP, orchestrating workloads with Kubernetes across a high-scale production environment
  • Promote and implement best practices for backend service development, testing, deployment, and monitoring (DevOps, SRE)

Requirements

  • Experience designing, deploying, and maintaining distributed systems at scale
  • Expertise in Golang for building high-performance, low-latency backend infrastructure
  • Hands-on experience with cloud infrastructure on GCP and workload orchestration with Kubernetes
  • Strong grounding in monitoring and observability tooling, including Prometheus and Grafana
  • Experience in ad tech, recommender systems, real-time personalization, or other performance-critical domains
  • Familiarity with microservice architectures, containerization (Docker), and CI/CD best practices
  • Familiarity with machine learning platforms, workflows, and serving infrastructure
  • You might also have
  • Experience with ML inference servers like NVIDIA Triton Inference Server
  • Familiarity with auction mechanics or bidding systems in an ad tech context
  • Experience embracing AI as a strategic advantage in engineering, following established best practices for code quality and security
  • Additional information
  • Relocation support is not available for this position

Benefits

At Unity, we want our team members to thrive. We offer a wide range of benefits designed to support well-being and work-life balance.Please note: Benefits eligibility, specific offerings, and coverage vary based on the country and employment status.Life at UnityThis position requires the incumbent to have a sufficient knowledge of English to have professional verbal and written exchanges in this language since the performance of the duties related to this position requires frequent and regular communication with colleagues and partners locateHealth insurancePaid time offEquity / stock options

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Unity? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect