Staff Systems Test Operations Engineer
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
Help shape how AI hardware reaches production at scale. As a Staff Hardware System Test Engineer, you will build manufacturing test solutions for advanced AI systems. Your work will help ensure every product meets the highest quality standards before deployment. You will develop test software, create manufacturing test content, and improve test infrastructure across server modules, blades, and rack scale systems. You will work closely with hardware, firmware, and software engineers to solve complex technical challenges. This is an opportunity to influence manufacturing test strategy, improve production quality through data, and raise engineering standards across multiple teams. The team and culture You'll join the Product Test and Diagnosis team within Manufacturing Operations. The team owns test strategy across the product lifecycle, from manufacturing through to deployed systems. Day to day, you'll work across hardware, firmware, software, quality, and manufacturing engineering. Decisions are driven by technical evidence, manufacturing data, and a shared focus on product quality. You'll have the freedom to own technical problems, contribute to engineering direction, and support colleagues through design reviews, debugging, and technical guidance.
Requirements
- Essential
- Experience developing manufacturing test solutions for high-performance servers, accelerators, or other large-scale compute hardware.
- Strong technical leadership as an individual contributor, driving complex technical work across multiple teams.
- Deep understanding of silicon, board, and system design, with experience defining test coverage and leading complex root cause investigations.
- Hands-on experience with Linux, OpenBMC, hardware diagnostics, and scripting or automation using Python, Bash, or similar.
- Proven ability to collaborate with hardware, firmware, software, and platform teams to improve testability and manufacturing quality.
- Desirable
- Experience with rack-scale or multi-node systems, high-speed interconnects, and production qualification of complex hardware.
- Experience improving manufacturing performance through data-driven optimisation of yield, test time, and false fail rates.
- Knowledge of open-source tools for manufacturing test, system bring-up, and hardware diagnostics.
- Experience designing testability and diagnosability features, such as telemetry, self-test hooks, or debug interfaces, in partnership with design teams.
Benefits
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Graphcore? Share your experience