Skip to main content
Back to jobs

Software Development Manager, Infrastructure Reliability

External
Amazon.com Services LLC logoAmazon.com · Nashville, TN
Full-timeOn-site4d ago
LeadershipMentoringObservabilityRobotics
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

If you are passionate about developing robust, highly available, scalable agentic and automated systems at tremendous scale, this is an excellent opportunity for you. Apply today to join our talented team!

Requirements

  • 3+ years of engineering team management experience
  • 7+ years of working directly within engineering teams experience
  • 3+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
  • 8+ years of leading the definition and development of multi tier web services experience
  • Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, including coding standards, code reviews, source control management, build processes, testing, certification, and livesite operations
  • Experience partnering with product or program management teams
  • Bachelor's degree or foreign equivalent in Computer Science, Engineering, Mathematics, or a related field
  • Experience in communicating with users, other technical teams, and senior leadership to collect requirements, describe software product features, technical designs, and product strategy
  • Experience in recruiting, hiring, mentoring/coaching and managing teams of Software Engineers to improve their skills, and make them more effective, product software engineers
  • Master's degree in computer science, engineering, mathematics or equivalent
  • Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
  • Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or a

Additional Information

Are you driven by innovation and complex problem-solving? At Infrastructure Reliability Engineering, we build scalable solutions that ensure the reliability of Amazon's critical systems. Our team develops and operates tools that detect and prevent outages to maintain high availability across Amazon's global infrastructure. Join us to architect solutions that directly impact millions of customers, with the resources and support to make meaningful contributions. The team at Amazon is responsible for building intelligent and real-time insights into service-to-service communications, network traffic, and event correlation across hundreds of Amazon's critical fulfillment and robotics services. Our solutions support visibility into anomalous service behavior to prevent and quickly recover from incidents, ensuring high availability to keep the Customer Promise. We are seeking talented Software Development Manager to invent the next generation of agentic observability solutions at Amazon scale. As part of this dynamic and forward-thinking team, you'll have the opportunity to lead greenfield programs while collaborating with leaders and customers across Amazon. We foster a culture that encourages personal and professional growth, empowering our team members to continually expand their skills and knowledge. If you are passionate about leading a high performance team, whose tech stack includes ML, back end, front end, and data engineering, then this role is for you! Key job responsibilities - Building, leading and growing diverse, inclusive and high-performing engineering product teams, including recruiting, mentoring, motivating, promoting and performance management of engineers and scientists. - Partner effectively with cross-functional teams including customers, product managers, stakeholders and leaders to ensure the successful delivery of product capabilities - Timely execution and prioritization of goals for the team and effectively mitigating risks. - Strategic leadership to create product roadmaps, set the vision for the team and collaboratively partner with other stakeholder teams. - Technical leadership including architecture and design discussions for science and engineering problems and technical deep dives to ensure resolution of root causes of complex issues. - Manage the operational support of products at scale to ensure prevention and timely resolution of customer impactful problems. A day in the life Amazon offers a full range of benefits that support you and eligible family members, including domestic partners. Benefits can vary by location, the number of regularly scheduled hours you work, length of employment, and job status such as seasonal or temporary employment. The benefits that generally apply to regular, full-time employees include: 1. Medical, Dental, and Vision Coverage 2. Maternity and Parental Leave Options 3. Paid Time Off (PTO) 4. 401(k) Plan If you are not sure that every qualification on the list above describes you exactly, we'd still love to hear from you! At Amazon, we value people with unique backgrounds, experiences, and skillsets. If you're passionate about this role and want to make an impact on a global scale, please apply!


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Amazon.com Services LLC? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect