Infrastructure Operations Engineer
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Manage and support production and non-production infrastructure running in AWS
- Administer, monitor, and troubleshoot Linux-based systems and services
- Support MySQL environments, including performance tuning, troubleshooting, maintenance, and operational health
- Participate in an on-call rotation and respond to infrastructure and service-related incidents
- Improve system reliability, availability, and performance through automation and operational best practices
- Partner with engineering teams to deploy, maintain, and optimize infrastructure services
- Troubleshoot issues across compute, storage, networking, operating systems, and databases
- Document processes, operational procedures, and runbooks to improve team effectiveness
- Contribute to continuous improvement efforts around monitoring, alerting, patching, and capacity planning
- Required qualifications
- 3+ years of experience in infrastructure engineering, systems engineering, site reliability engineering, or a related role
- Strong hands-on experience with AWS services and cloud-based infrastructure
- Solid experience administering and troubleshooting MySQL in production environments
- Strong Linux systems administration skills
- Experience supporting production services and participating in an on-call rotation
- Experience with either Ansible, Salt, Chef and Terraform for infrastructure provisioning and management
- Familiarity with scripting or automation for operational tasks using Bash and Python
- Strong troubleshooting skills and the ability to work through complex technical issues methodically
- Good communication skills and the ability to work effectively with cross-functional teams
Requirements
- Experience with docker or other container orchestration platforms
- Experience with infrastructure as code and automation tools
- Familiarity with monitoring, logging, and alerting platforms
- Experience working in high-availability or customer-facing production environments
- Ownership mindset and strong operational discipline
- Ability to stay calm and effective during incidents
- Willingness to learn, improve systems, and drive reliability-focused changes
- Practical approach to solving infrastructure and operational problems
- Team player who values clear communication and documentation
- On-call expectations: This position requires participation in a scheduled on-call rotation to support production systems and help ensure service reliability.
- All applications are treated in accordance with the SolarWinds Privacy Notice: https://www.solarwinds.com/applicant-privacy-notice
Benefits
Additional Information
At SolarWinds, we're a people-first company. Our purpose is to enrich the lives of the people we serve-including our employees, customers, shareholders, partners, and communities. Join us in our mission to help customers accelerate business transformation with simple, powerful, and secure solutions. The ideal candidate thrives in an innovative, fast-paced environment and is collaborative, accountable, ready, and empathetic. We're looking for individuals who believe they can accomplish more as a team and create lasting growth for themselves and others. We hire based on attitude, competency, and commitment. Solarians are ready to advance our world-class solutions in a fast-paced environment and accept the challenge to lead with purpose. If you're looking to build your career with an exceptional team, you've come to the right place. Join SolarWinds and grow with us!
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at solarwinds? Share your experience