Senior Software Development Engineer, EC2 Trainium AI Infra
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
The EC2 UltraServer Provisioning team is a high-performing engineering organization responsible for delivering AWS Trainium-based UltraServers infrastructure at scale. We manage end-to-end provisioning workflows from host ingestion through testing, repair, and recovery.
Requirements
- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience as a mentor, tech lead or leading an engineering team
- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
- USA, WA, Seattle - 168,100.00 - 227,400.00 USD annually
Additional Information
The Software Development Engineer will lead the team in technical strategy, design, build, and operation of infrastructure services including provisioning and availability of AWS Trainium-based AI servers. This role requires expertise in architecting large-scale systems, building micro services, and cross-functional collaboration with several other teams such as capacity management, hardware engineering, and datacenter teams to manage AI/ML infrastructure. Key job responsibilities - Design and develop innovative technologies that power the infrastructure supporting AI workloads on Ultraservers - Lead technical projects establishing EC2 as the pioneer in cloud computing for AI/ML workloads across diverse applications including LLMs, multimodal systems, and emerging model architectures. - Collaborate with various teams to influence architecture of provisioning systems and improve to operate at scale and efficiently. - Build customer relationships by investigating complex performance challenges, developing solutions, and publishing actionable best practices through multiple channels.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Amazon Development Center U.S., Inc.? Share your experience