Data Scientist
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
As a Data Scientist, you will work at the intersection of data science and production systems, deploying advanced statistical models and machine learning methodologies directly within operational environments. You will collaborate closely with engineering and infrastructure teams to optimize GPU utilization, workload scheduling, and system efficiency in real time. You will design experiments, analyze large-scale system telemetry data, and build predictive and optimization solutions that are tightly integrated into production workflows. This role blends hands-on deployment with analytical rigor, turning complex infrastructure data into measurable improvements in performance and cost. You will translate research and modeling insights into scalable, production-ready systems.
Requirements
- MS or PhD in Computer Science, Statistics, Applied Mathematics, Machine Learning, or related quantitative field
- 8+ years (or equivalent experience) applying statistical modeling or machine learning to large-scale datasets
- Strong proficiency in Python and scientific computing libraries (NumPy, pandas, SciPy, scikit-learn, PyTorch or TensorFlow)
- Demonstrated experience designing and analyzing controlled experiments (A/B testing, causal inference, hypothesis testing)
- Experience working with distributed data systems (Spark, Ray, Dask, or similar)
- Proficiency in SQL and working with large-scale structured datasets
- Experience building, deploying, and maintaining predictive models in production environments
- Strong understanding of optimization techniques (linear programming, convex optimization, stochastic optimization, or reinforcement learning)
- Experience working with time-series data and performance telemetry
- Ability to translate analytical insights into production-ready systems and workflows
- Strong collaboration skills and ability to work cross-functionally in fast-paced environments
- Preferred:
- PhD with published research in systems optimization, distributed computing, ML systems, or performance modeling
- Experience with GPU workloads, distributed training, or AI infrastructure
- Familiarity with Kubernetes, containerized workloads, or cloud-native systems
- Experience deploying reinforcement learning or adaptive scheduling systems in production
- Background in capacity planning, forecasting, or resource allocation modeling
- Contributions to open-source ML or systems projects
- You love uncovering hidden failure patterns in massive, noisy infrastructure datasets
- You enjoy working directly with engineering teams to deploy solutions that have immediate real-world impact
- You're curious about building autonomous, agentic systems that investigate and explain system behavior
- You're an expert in reinforcement learning, predictive modeling, or large-scale data analysis
- Why CoreWeave?
- Be Curious at Your Core
- Act Like an Owner
- Empower Employees
- Deliver Best-in-Class Client Experiences
- Achieve More Together
- We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and enables the development of innovat
Additional Information
CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at www.coreweave.com . We're proud to be a Living Wage accredited Employer.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at coreweaveu? Share your experience