Skip to main content
Back to jobs

Solutions Architect, Data Center Infrastructure - NVIS

External
NVIDIA logoNvidia · US
Full-timeOn-siteToday
Deep LearningFiberLeadership
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • NVIS Datacenter Engineering and planning: Collaborate with other teams to plan and implement data center infrastructure solutions based on NVIDIA Datacenter reference architecture, including power distribution, cooling systems, network architecture, server hardware, and storage systems.
  • Hardware deployment: Plan and manage deployment of NVIDIA's pioneering AI infrastructure, including highly complex rack-scale, liquid-cooled compute and networking systems, in a fluid and fast-paced environment.
  • Pre-deployment planning: Review cluster and data center architecture, plan network port mapping and fiber-optic cabling BOM, identify risks, train and enable vendors, and find areas for improvement.
  • Design evaluation: Evaluate customer and partner infrastructure design proposals for consistency with industry standards and regulatory requirements; provide feedback and recommendations to improve performance, scalability, and cost-effectiveness.
  • Hardware & remediation: Lead physical troubleshooting cabling debug, mis-wire correction, power and leak issues and drive clusters from initial bring-up to full operational acceptance, partnering with validation, network, and TAM teams to close out remaining compute-tray and networking issues.
  • Vendor & partner management: Oversee smarthands, integration, cabling, and OEM partners-manage dispatches, validate work quality, enforce SLAs, and hold partners accountable-scaling deployment and remediation capacity through partners.
  • Perform testing, troubleshooting and validation of compute systems based on collaboration with product and engineering teams.
  • Quality Assurance: Establish and enforce quality assurance processes to verify that deployments meet established specifications and performance benchmarks. Conduct thorough bring-up, testing, and validation to validate the functionality and reliability of infrastructure components.
  • Teamwork & communication: Collaborate across internal teams, external vendors, and customers to enable seamless integration of data center infrastructure solutions; serve as a domain expert and point of contact for infrastructure and remediation inquiries and blocking issues.
  • What we need to see:
  • Bachelor's degree (or equivalent experience) in Engineering, Computer Science, Information Technology, or a related field.
  • Minimum 3+ years of overall experience in enterprise and/or hyperscale data centers with continual infrastructure deployment experience, preferably for high density AI/HPC data centers.
  • Hands-on experience with hardware break/fix and RMA workflows (triage, swap, ship, track, close) and spares/logistics coordination across multiple sites.
  • Experience building processes and playbooks from scratch, a bias toward action and comfort operating with minimal supervision in ambiguous, fast-paced environments.
  • Demonstrated technical and project leadership under fluid situations, ability to adapt to unknowns and change.
  • Excellent analytical, problem-solving, and decision-making skills, k

Additional Information

NVIDIA is seeking a Solutions Architect in Data Center Infrastructure to join our Infrastructure Specialists (NVIS) team. Academic and commercial groups worldwide are using NVIDIA products to redefine deep learning, data analytics, and power data centers. Join the team building many of the world's largest and fastest AI Factories and supercomputers. This role leads the infrastructure planning, physical deployment, and hardware remediation that drives every cluster to full operational acceptance, spanning power and cooling systems, cabling and network bring-up and validation, and RMA. As the NVIS Solutions Architect for Datacenter Infrastructure, you will focus on data center audit, planning and deployment ensuring the integrity of NVIDIA platform infrastructure. Your primary goal will be to guarantee that all aspects of the data center's physical infrastructure are meticulously planned, implemented, and validated to meet NVIDIA reference architectures, operational requirements, and industry standards. This infrastructure includes architectural systems, power distribution, liquid/air cooling systems, compute, network and cabling (fiber and copper), and telemetry systems.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at NVIDIA? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect