Systems Engineer
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
We are building next-generation AI infrastructure from the ground up. Our mission is to deliver highly performant, reliable, and scalable infrastructure purpose-built for large-scale AI training and inference. As a startup, we operate with urgency, ownership, and a bias toward action. We are assembling the foundational infrastructure that will power frontier AI workloads-and we're looking for engineers who want to build it from zero to scale. We are hiring a Senior Systems Engineer to support the deployment and bringup of infrastructure across our data center environments. You will help execute node, rack, and system deployments, ensuring infrastructure is validated, performant, and production-ready. This role is deeply technical and execution-focused. You will work hands-on in the details-deploying infrastructure, validating firmware and system configurations, troubleshooting performance issues, and helping us build repeatable processes as we scale.
Responsibilities
- Infrastructure Deployment & Bringup
- Execute end-to-end bringup of infrastructure nodes and racks from installation to production readiness
- Validate BIOS, BMC, and firmware configurations
- Perform rack-level integration including power, cabling, and airflow validation
- Support deployment and validation of high-performance infrastructure environments
- System & Performance Validation
- Run infrastructure burn-in and validation testing
- Validate node-to-node system performance across distributed environments
- Troubleshoot hardware, firmware, and infrastructure-level issues
- Assist with system reliability and performance optimization
- Automation & Process
- Contribute to automation for provisioning and infrastructure validation
- Improve deployment playbooks and operational documentation
- Identify reliability issues early and drive corrective actions
- Help turn manual deployment processes into repeatable systems
- Cross-Functional Collaboration
- Work closely with infrastructure, networking, systems software, and data center teams
- Coordinate with hardware vendors to resolve deployment or system issues
- Support rapid infrastructure expansion as capacity grows
Requirements
- Required
- 5-8+ years in infrastructure engineering, hardware deployment, or data center operations
- Hands-on experience deploying server infrastructure in data center environments
- Strong Linux systems knowledge
- Experience troubleshooting distributed systems or infrastructure performance issues
- Comfortable working onsite in data center environments as needed
- Strongly Preferred
- Experience in AI/ML infrastructure or HPC environments
- Familiarity with high-performance computing or distributed systems
- Automation experience (Python, Ansible, Terraform, Bash)
- Experience working in high-density data center environments
- What Success Looks Like
- Infrastructure systems are brought online quickly and correctly
- Performance baselines meet or exceed expectations
- Deployment processes become faster and more reliable over time
- You help build the foundation for scaled infrastructure growth
- For information on how Nscale handles candidate personal data, please see our Employee & Candidate Privacy Notice: Here.
Benefits
Additional Information
. Systems Engineer
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at nscaleoperationsukltd? Share your experience