Skip to main content
Back to jobs

Principal Software Engineer - ML Platform Engineer

External
Riot Games logoRiot Games · Singapore
Full-timeOn-siteToday
RailsCI/CD
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • Design, implement, and evolve internal platform capabilities that make AI Efficiency services easier to build, ship, observe, secure, and operate
  • Build and maintain self-service workflows, reusable platform abstractions, and golden paths that improve developer productivity while preserving reliability, security, and governance
  • Improve platform reliability through better monitoring, alerting, observability, deployment safety, release practices, and incident readiness
  • Define and operationalize service health indicators, SLIs, SLOs, and related reliability metrics that help teams make informed tradeoffs between reliability, velocity, and cost
  • Build automation that reduces operational toil and improves mean time to detect, respond, and recover from incidents
  • Partner with engineers throughout the software development lifecycle to embed operability, production readiness, and maintainability into system design, implementation, rollout, and ongoing support
  • Improve CI/CD systems, developer workflows, and release pipelines so shipping becomes safer, faster, and more repeatable
  • Identify platform and reliability risks across distributed systems, infrastructure, service dependencies, and operational workflows, and drive durable improvements
  • Troubleshoot AI model-serving issues across frameworks, runtimes, and hardware environments, including diagnosing configuration, compatibility, and performance issues across different GPU platforms and supporting model format conversion workflows when needed
  • Design and run resilience, recovery, and failure-mode testing to validate system behavior under stress and uncover hidden weaknesses before they impact users
  • Evaluate, integrate, and operate AI-assisted engineering tools that improve code quality, reliability, security, performance, and developer productivity across the software delivery lifecycle
  • Build and evolve automation pipelines that combine conventional CI/CD systems with agentic workflows such as automated code review, bug detection, regression analysis, tes

Additional Information

Riot Games was established in 2006 by entrepreneurial gamers who believe that player-focused game development can result in great games. In 2009, Riot released its debut title League of Legends to critical and player acclaim. As the most played PC game in the world, over 100 million play every month. Players form the foundation of our community and it's for them that we continue to evolve and improve the League of Legends experience. We're looking for humble but ambitious, razor-sharp professionals who can teach us a thing or two. We promise to return the favor. Like us, you take play seriously; you're passionate about games. We embrace those who see things differently, aren't afraid to experiment, and who have a healthy disregard for constraints. That's where you come in. The AI Efficiency team at Riot Games builds the platforms, tools, and technical foundations that help Rioters safely and effectively use AI to accelerate how we work. As these systems become increasingly important to creative, product, and development workflows across Riot, we need platform engineering that can keep pace with growing scale, complexity, and expectations. As a Principal Platform Engineer on the AI Efficiency team, you will design, build, and evolve the internal platforms, automation systems, and operational guardrails that make our AI services and developer workflows more scalable, reliable, and easy to use. You will partner closely with software engineers, infrastructure teams, and cross-functional stakeholders to improve developer experience, platform reliability, deployment safety, observability, and operational excellence across a growing portfolio of AI services, internal tooling, and supporting infrastructure. You will also help the team evaluate and operationalize a new generation of AI-native engineering workflows, including agent-assisted code review, automated bug triage and remediation, AI-driven performance and security analysis, browser-based UI validation, and other emerging automation patterns that can safely augment human judgment. This role is not only about building better platforms for today, but also about shaping how Riot adopts the next wave of intelligent engineering tooling responsibly and effectively. Current platform-engineering guidance emphasizes product-minded internal platforms, self-service, and golden paths, while modern browser automation tooling now explicitly supports AI-agent workflows and accessibility-focused testing. You're right for this role if you enjoy making complex systems easier to use and operate, reducing cognitive load for engineers, building paved roads instead of one-off solutions, and improving reliability through strong platform design. You are energized not only by hard infrastructure and operational problems, but also by the opportunity to responsibly bring new AI-native automation patterns into real engineering workflows.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Riot Games? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect