Cloud Systems Engineer
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Benefits
Additional Information
SOLV Energy is a leading provider of infrastructure services to the power industry, designing, building and maintaining utility scale solar, battery storage and high voltage substation projects nationwide. Job Description Summary: The Cloud Systems Engineer will will manage, maintain, and support SOLV Energy's Azure and AWS cloud-based infrastructure to ensure consistent, reliable, and secure operations, while maximizing the value of subscribed systems and services. This is a hands-on technologist role, reporting to the IT Infrastructure and Cloud Manager and will have responsibility for all aspects of the company's cloud-based computing infrastructure and services. This role is hybrid, with regular in-office presence in San Diego, CA. Specific location details and expectations will be discussed during the interview process. Job Description: *This job description reflects management's assignment of essential functions; it does not prescribe or restrict the tasks that may be assigned Position Responsibilities and Duties: Assess current Azure and AWS instance and create continuous improvement roadmap. Define cloud systems strategy to maximize the return on IT investments, while meeting or exceeding uptime and performance expectations. Drive the adoption of best practices in cloud architecture and operations, ensuring high standards of performance and security. Design, plan, and implement Azure and AWS based systems and services in support of functional, storage, compute, data integration, and systems security initiatives. Collaborate with S e cOps and IT Operations team s to ensure that new or updated solutions and services comply with the enterprise cyber security standards. Proactively identify and apply system updates to prevent issues, strengthen security, tune performance, automate tasks, and manage costs. Monitor Azure and AWS systems operations and address alerts, anomalies, and issues . Develop and support D isaster Recovery , Backup, and retention policies on Azure and AWS platforms . Maintain zero trust endpoint security with tools such as Microsoft Defender Develop and implement training programs for team members to enhance their skills and knowledge in Azure technologies Lead project planning and execution, ensuring timely delivery of cloud solutions and adherence to project timelines Manage cloud networking and security and monitor the logging of systems. Development and maintenance of IT policies and procedures. Ensure compliance with IT General Controls, provide needed information in support of audits and to substantiate process and controls compliance. Own and maintain enterprise monitoring and alerting platforms, including Zabbix and cloud‑native tools, to provide clear visibility into the health, performance, capacity, and availability of Azure and AWS environments. Build and support automation workflows using scripting and orchestration tools such as Ansible/AWX and operational runbooks to reduce manual effort, improve reliability, and streamline day‑to‑day operations. Identify and adopt practical AI‑assisted features within monitoring and automation tools to improve anomaly detection, alert quality, and operational insights, while ensuring decisions and remediation remain under engineering control. Minimum Skills or Experience Requirements: Bachelor's degree in Information Technology , related technology field, or equivalent combination of education and experience 10+ years overall IT infrastructure experience, including 5+ years building and operating production workloads in Azure or AWS at enterprise scale Expertise in Azure or AWS cloud system s oversight, performance tuning, and administration. Linux Operating System Experience / Knowledge Strong knowledge of cloud security technologies and best practices Expert ise with Infrastructure as Code: design and security, configuration management, integration, deployment, performance monitoring and tuning, automation of infrastructure. Expertise with ARM templates and Terraform to enable automation. Expertise with Entra ID Administration Expertise with networking and networking protocols & services. Proficiency in scripting languages such as PowerShell, Python, and Bash for automation Expertise with deployment techniques (and tools) in a distributed environment. Strong oral and written communication skills with a high degree of comfort with varying types of audiences Emotional intelligence, flexible work style, and excellent diplomatic skills across all levels of an organization Expertise designing and delivering complex solutions on time and with expected quality. Expertise supporting various compliance and regulatory frameworks. Advanced skills in performance tuning and optimization of cloud resources Ability to multi-task, establish priorities, work independently, man a ge time, and deliver on commitments. Hands‑on experience operating enterprise monitoring and alerting platforms Practical experience