Skip to main content
Back to jobs

Staff Software Engineer, Inference

External
coreweaveu logoCoreweaveu · Warsaw, Poland
Full-timeOn-site2d ago
CachingCapacity PlanningComplianceGDPRKubernetesLeadership
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • Technical Leadership & Architecture
  • Define and drive multi-service, cross-team architecture for CoreWeave's inference platform.
  • Lead complex design initiatives spanning request routing, adaptive scheduling, GPU resource management, and cost-per-token optimization .
  • Set technical direction for low-latency, high-throughput inference systems operating under strict P99 SLAs.
  • Performance & Optimization
  • Lead development of advanced inference optimizations, including:
  • micro-batching and dynamic scheduling
  • speculative decoding
  • KV-cache reuse and memory optimization
  • Drive measurable improvements in P95/P99 latency, throughput, and GPU utilization across the platform.
  • Establish performance benchmarking frameworks and guide teams in data-driven optimization strategies .
  • Reliability & Systems Excellence
  • Own system-wide SLIs/SLOs and ensure reliability improvements across releases.
  • Lead efforts in capacity planning, autoscaling strategies, traffic management, and failure mitigation .
  • Drive post-incident analysis and ensure systemic improvements across teams.
  • Cross-Team Influence
  • Partner with Product, Orchestration, Networking, Storage, and Hardware teams to deliver a cohesive inference platform .
  • Influence technical decisions across org boundaries and ensure alignment on architecture, performance, and scalability goals .
  • Serve as a go-to technical expert for inference systems and distributed AI workloads.
  • Mentorship & Engineering Excellence
  • Mentor senior and mid-level engineers; elevate design, coding, and operational standards across teams.
  • Lead cross-team design reviews and ensure high-quality system thinking at scale.
  • Raise the bar for engineering rigor, testing, and observability practices .

Requirements

  • ~8-12+ years of experience building large-scale distributed systems or cloud platforms
  • Proven track record of leading cross-team technical initiatives at scale
  • Strong coding skills in Go, Python, or C++
  • Deep expertise in Kubernetes at production scale , including orchestration, scheduling, and service design
  • Strong understanding of networked systems, performance optimization, and system design
  • Bachelor's degree in Computer Science , Engineering, or a related technical field
  • Inference & Performance Expertise
  • Hands-on experience with inference systems, including:
  • batching and micro-batching strategies
  • caching and memory optimization
  • mixed precision (BF16/FP8)
  • streaming token delivery
  • Demonstrated ability to improve tail latency (P95/P99) and system reliability through metrics-driven engineering
  • Contributions to inference frameworks such as vLLM, Triton, TensorRT-LLM, Ray Serve, TorchServe
  • Experience with GPU systems and performance optimization (CUDA, NCCL, RDMA, NUMA, GPU interconnects)
  • Experience leading multi-team or org-level initiatives
  • Exposure to large-scale AI/ML infrastructure or hyperscale cloud environments
  • To fulfill our obligation to protect client data, successful applicants offered employment with CoreWeave will be required to complete a basic criminal record check, conducted in compliance with GDPR. Employment offers are conditional upon receiving satisfactory check results

Benefits

In addition to a competitive salary, we offer a variety of benefits to support your needs, including:Family-level Medical InsuranceFamily-level Dental InsuranceGenerous Pension ContributionLife Assurance at 4x SalaryCritical Illness CoverEmployee Assistance ProgrammeTuition ReimbursementWork culture focused on innovative disruptionBenefits may vary by location.Equal OpportunityRecrDental insuranceEquity / stock optionsPerformance bonus

Additional Information

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at www.coreweave.com . We're proud to be a Living Wage accredited Employer.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at coreweaveu? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect