Skip to main content
Back to jobs

AI Computing Software Development Engineer, LLM Inference

External
NVIDIA logoNvidia · Shanghai, China
Full-timeOn-site4d ago
Machine LearningTensorFlowPyTorchiOS
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance
  • Performance analysis, optimization and tuning
  • Closely follow academic developments in the field of artificial intelligence and large language models
  • Provide feedback into the architecture and hardware design and development
  • Collaborate across the company to guide the direction of machine learning inferencing, working with software, research and product teams
  • Publish key results in scientific conferences
  • What we need to see:
  • Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience)
  • 2+ years of relevant software development experience.
  • Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.
  • Strong curiosity about artificial intelligence, awareness of the latest developments in deep learning like LLMs, generative and recommender models
  • Experience working with deep learning frameworks like TensorFlow and PyTorch
  • Proactive and able to work without supervision
  • Excellent written and oral communication skills in English

Additional Information

We are now looking for a Software Development Engineer to help TensorRT LLM and TensorRT Edge LLM projects! NVIDIA is hiring software engineers for its AI Computing team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT and GenerativeAI that has put DL at the "iPhone moment" for AI. Join the team which is building the inferencing software which will be used across our product lines! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at NVIDIA? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect