Master degree or Ph.D. in in Computer Science, Artificial Intelligence, Software Engineering, or related fields.
Good programming skills of modern C++ and Python
Know foundations of Deep Learning theory and some hands-on experience
Communicate in English fluently (both written and spoken)
Are passionate on solving problems and positive thinker
Experience of performance optimization or high efficiency kernel development experience is a plus
Experience of PyTorch, SGLang, vLLM is a plus
Experience of LLM and deep understanding of model structure is a plus
Job Type:
College Grad
Shift:
Shift 1 (China)
Primary Location:
PRC, Shanghai
Additional Locations:
Business group:
Posting Statement:
Position of Trust
N/A
Work Model for this Role
This role will require an on-site presence. * Job posting details (such as work model, location or time type) are subject to change.
*
Additional Information
Job Details:
Job Description:
Conducts design and development to build and optimize AI software. Designs, develops, and optimizes for AI frameworks (e.g., SGLang) and contribute to public upstream. Implements various distributed algorithms such as model/data parallel frameworks, parameter servers, dataflow based asynchronous data communication in machine learning, and/or deep learning frameworks. Transforms computational graph representation of neural network model, and develops machine learning and/or deep learning primitives in mathematical libraries. Profiles distributed deep learning models to identify performance bottlenecks and proposes solutions across individual component teams. Optimizes code for various computing hardware backends, and interacts with machine learning and/or deep learning researchers, and utilizing experience with machine learning and/or deep learning frameworks.