Software Development Engineer, EC2 Trainium AI Infra
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
The EC2 UltraServer Provisioning team is a high-performing engineering organization responsible for delivering AWS Trainium-based UltraServers infrastructure at scale. We manage end-to-end provisioning workflows from host ingestion through testing, repair, and recovery.
Requirements
- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- 1+ years of software development engineer or related occupational experience
- 1+ years of designing and developing large-scale, multi-tiered, multi-threaded, embedded or distributed software applications, tools, systems, and services using: C#, C++, Java, or Perl experience
- 1+ years of Object Oriented Design experience
- Bachelor's degree or foreign equivalent in Computer Science, Engineering, Mathematics, or a related field
- Experience programming with at least one software programming language
- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
- USA, WA, Seattle - 143,700.00 - 194,400.00 USD annually
Additional Information
EC2 Infrastructure Services organization is responsible for making EC2 instances available to our customers at all times. We are a key part of what makes EC2 elastic. AI infrastructure has taken a key place in EC2 and we are building systems, services, and automation to operate this at scale. The Software Development Engineer will design, build, and maintain cloud-based provisioning and recovery systems for AWS Trainium-based AI UltraServers. This role requires expertise in AWS services, system architecture, and cross-functional collaboration with Capacity Management, Hardware Engineering, and Datacenter Operations to manage AI/ML infrastructure. Key job responsibilities Key job responsibilities - The Software Development Engineer is responsible for building and maintaining scalable micro services. - They are adept at system design that solves the business problem efficiently. - Work in environments where the technology strategy is defined but the solution design is not - Build cloud-based solutions using AWS native services for scaling infrastructure frameworks - Create observable systems with appropriate metrics and alarming - Collaborate with customers and stakeholders to convert business needs into technical designs - Participate in code reviews and technical assessments
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Amazon.com Services LLC? Share your experience