Data Center Power Test Architect
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Own Data Center Power Quality Metrics: Define quality KPIs such as code coverage, system uptime, bug escape rate, and validation completeness. Establish dashboards and reporting mechanisms to track progress and drive data-driven decision-making.
- Drive Root Cause Analysis and Debugging: Lead complex issue investigations that span firmware, software, and hardware layers. Develop and document debug methodologies and tools to improve diagnosis efficiency across the team.
- Innovate in Lab Automation and CI/CD: Partner with DevOps and infrastructure teams to enhance test automation pipelines, integrate continuous testing into nightly and pre-merge workflows, and ensure fast and reliable release qualification.
- Enable Productization and Customer Readiness: Validate real-world use cases, customer configurations, and production scenarios. Contribute to release gates and sign-off criteria to ensure firmware is ready for deployment in systems critical to the mission.
- Boost Team Efficiency with AI: Demonstrate proven experience using AI-powered tools and copilots to accelerate test development, automate repetitive validation workflows, and streamline debug and root cause analysis.
- What we need to see:
- B.S./M.S./PHD in Electrical Engineering, Computer Engineering, Computer Science, or related field (or equivalent experience).
- 10+ years of experience in data center power enablement related to software/firmware testing, with a focus on telemetry and power efficiency across systems
- Strong knowledge of system architecture, power shelf, baseboard management, hardware and software power features, industry power standards, system interfaces, and embedded controllers.
- Proven experience designing test frameworks and infrastructure in Python, C/C++, or similar languages.
- Expertise with platform standards for security, telemetry and manageability (NIST, DMTF, OCP) Hands-on experience with server platform, network, storage, cluster configuration and debugging.
- Background with platform telemetry, datacenter node lifecycle management/support including CPU/GPU workloads. Proficiency in scripting languages such as Python.
- Expertise in administering, operating, and configuring Kubernetes and Envoy. Validated expe
Additional Information
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent. As an NVIDIAN, you'll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world! We are seeking a highly skilled and hard-working Senior Test Architect to join our multifaceted Enterprise Software QA team. This role offers an outstanding opportunity to leave your mark on the design, construction, optimization and testing of our flagship super computers and data center offerings. If you are a dedicated engineer with a deep understanding of data center power systems, and you thrive in an exciting, innovative environment, this could be the flawless role for you!
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at NVIDIA? Share your experience