Skip to main content
Back to jobs

US Infrastructure & Operations Technical Lead

External
Radiant logoRadiant · East Coast
Full-timeOn-site1mo ago
DocumentationLeadershipLinuxMachine LearningMoveSite Reliability Engineering
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • Leadership & Operational Ownership
  • Lead a small but high-impact US Infrastructure Operations team, owning both people leadership and technical execution
  • Ensure 99.9%+ platform uptime across US-region services.
  • Act as the senior US operational owner for production infrastructure, accountable for reliability, incident outcomes, and day-to-day operational execution
  • Partner tightly with the UK Infrastructure Operations Manager to align priorities, respond to incidents, and execute global infrastructure plans in real time
  • Own US-side incident leadership, driving fast and effective resolution of production-impacting infrastructure issues
  • Build and reinforce a strong ownership culture built on do, document, automate
  • Ensure operational knowledge is captured and shared through lightweight, high-signal documentation rather than process overhead
  • Hire, onboard, and develop Infrastructure Operations engineers as the team scales
  • Run direct 1:1s and performance conversations focused on raising technical bar and operational effectiveness
  • Ensure disciplined execution of core operational processes (incident, change, problem management) without slowing delivery
  • Participate in on-call rotation and lead from the front during major incidents
  • Willingness to travel within the US and Europe as required to support infrastructure deployments, data centre work, and

Benefits

We move quickly, solve meaningful infrastructure challenges, and provide engineers with the opportunity to influence how next-generation AI infrastructure is designed, operated, and scaled globally.You can also expect:Exposure to industry-leading GPU and AI infrastructureOpportunities to help build and scale a growing US operations functionA collaborative, inclusive, and globally connected engineering cultureReal ownership and influence across operational strategy and executionWork at the intersection of reliability, automation, performance, and scaleA flexible remote-first working environment with ambitious growth plansRemote work optionsFlexible schedule

Additional Information

Role Overview We are seeking a US Infrastructure Operations Technical Lead to drive the operational excellence, technical leadership, and growth of Radiant's US Infrastructure Operations function. This is a hands-on player-manager role designed for an infrastructure-focused engineering leader with a strong Site Reliability Engineering mindset and deep understanding of large-scale distributed infrastructure environments. Working closely with the UK Infrastructure Operations Manager during overlapping morning hours (US Eastern Time), you will help coordinate cross-regional operations, strategic planning, incident management, and infrastructure delivery across Radiant's global AI and HPC platform. During US business hours, you will lead and mentor the local Infrastructure Operations team, currently consisting of three engineers, while helping scale operational maturity and team capability as the business continues to grow. The ideal candidate will come from a hyperscale, HPC, or large-scale cloud-native SaaS infrastructure background, with experience operating complex distributed systems at scale. This role requires breadth across datacentre compute, Linux systems, networking, and storage fabrics, with the ability to troubleshoot and lead continuous improvement of our infrastructure. You should be comfortable operating and troubleshooting bare-metal environments, low-latency networking, storage protocols, and core infrastructure technologies underpinning high-performance AI and GPU compute platforms. This role requires strong operational leadership capabilities, including experience running small engineering teams, participating in ITIL-aligned operational processes, and supporting high-availability production environments through structured incident, change, and problem management practices. You will also participate in an on-call rota to lead major incidents, orchestrating technical resources to quickly resolve large scale issues. As Radiant expands its global footprint, your operational leadership, technical expertise, and ability to build high-performing teams will play a critical role in shaping the future of our US infrastructure operations.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Radiant? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect