Software Development Engineer II, Intelligent Cloud Hosting (ICON)
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
The ICON team hosts Amazon websites and backend platform services that power the customer experience. We abstract and centralize the management from service teams, bring agility to business processes, and operate 100s of Tier-1 services behalf of our internal customers. ICON is an early adopter of Amazon's product offerings and automates the management and operational activities on behalf of developers to strengthen services' posture towards security, availability, and efficiency so that service developers can innovate on behalf of Amazon's end customers. The ICON team is located within the Intelligent Control (ICC) organization who owns connecting our worldwide websites and other consumer experiences such as Kindle, Amazon Video, and Alexa to the internet, as well as ensuring the highest level of availability, security and privacy of the web services that power the experience we deliver to our customers worldwide.
Requirements
- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language
- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
- The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualif
Additional Information
Amazon's Intelligent Cloud Hosting (ICON) team is looking for a Software Development Engineer (SDE) to join our team. ICON is responsible for the reliability and operational excellence of Amazon's cloud hosting infrastructure, supporting all of Amazon's global marketplaces, partner portals, and consumer experiences including Kindle, Alexa, Amazon Video, and the Mobile Application. The team builds intelligent systems that proactively detect, diagnose, and resolve incidents across hundreds of thousands of services powering one of the world's largest distributed architectures. The challenges SDEs solve on this team are high-impact and mission-critical. The team is building AI-powered incident response systems that automatically investigate production issues, identify root causes from metrics, logs, and deployment events, and recommend mitigations to on-call engineers. These systems operate at massive scale, processing thousands of signals per investigation and reducing mean-time-to-resolution for critical production incidents. As an SDE II on the team, you will: * Design and build production generative AI workflow that automate incident investigation workflows, from alert ingestion through root-cause analysis to mitigation recommendations * Work on tier-1, multi-tenant, high-performance systems built on AWS services (Step Functions, Bedrock, DynamoDB, Athena) with technical challenges unique to this kind of scale and throughput * Build developer productivity and operational tooling including orchestration, predictive analytics, automated diagnosis, and self-healing systems * The team is looking for engineers who are passionate about applying generative AI and machine learning to operational problems, thrive in ambiguous environments, and want to build systems that keep Amazon's infrastructure running for millions of customers worldwide. Key job responsibilities * Design and build distributed systems and automation in a large-scale cloud environment that supports millions of customers globally * Develop scalable services and tools on AWS that process high volumes of operational data to drive better decision-making * Solve broadly defined problems from design to delivery, balancing speed with long-term technical quality * Collaborate with engineers, scientists, and product managers to scope projects and ensure deliverables meet a high quality bar * Evaluate and apply emerging technologies, including generative AI and machine learning, to solve real-world operational challenges * Work in an agile environment delivering high-quality software with a strong focus on operational excellence, security, and availability
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Amazon.com Services LLC? Share your experience