Skip to main content
Back to jobs

Senior Software Development Engineer, EC2 Trainium AI Infra

External
Full-timeOn-site1mo ago30+ days old, may be filled
AWS
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

The EC2 UltraServer Provisioning team is a high-performing engineering organization responsible for delivering AWS Trainium-based UltraServers infrastructure at scale. We manage end-to-end provisioning workflows from host ingestion through testing, repair, and recovery.

Requirements

  • 5+ years of non-internship professional software development experience
  • 5+ years of programming with at least one software programming language experience
  • 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • Experience as a mentor, tech lead or leading an engineering team
  • 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
  • Bachelor's degree in computer science or equivalent
  • Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
  • USA, WA, Seattle - 168,100.00 - 227,400.00 USD annually

Additional Information

The Software Development Engineer will lead the team in technical strategy, design, build, and operation of infrastructure services including provisioning and availability of AWS Trainium-based AI servers. This role requires expertise in architecting large-scale systems, building micro services, and cross-functional collaboration with several other teams such as capacity management, hardware engineering, and datacenter teams to manage AI/ML infrastructure. Key job responsibilities - Design and develop innovative technologies that power the infrastructure supporting AI workloads on Ultraservers - Lead technical projects establishing EC2 as the pioneer in cloud computing for AI/ML workloads across diverse applications including LLMs, multimodal systems, and emerging model architectures. - Collaborate with various teams to influence architecture of provisioning systems and improve to operate at scale and efficiently. - Build customer relationships by investigating complex performance challenges, developing solutions, and publishing actionable best practices through multiple channels.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Amazon Development Center U.S., Inc.? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect