AI Computing Development Engineer, TensorRT and TensorRT-LLM

External

Nvidia · Shanghai, China

Full-timeOn-site1d ago

PythonMachine LearningPyTorchComputer VisioniOS

Prepare for this interview

AI-generated questions, company research, and talking points tailored to this role

Responsibilities

Design and develop robust inferencing software (TensorRT/TensorRT-LLM) optimized for functionality and performance across platforms
Perform performance analysis, optimization, and tuning of deep learning inference workloads
Track and integrate academic and industry advancements in AI and feature-update TensorRT/TensorRT-LLM accordingly