Senior Software Engineer, Observability Insights
External$165K–$242K/yrFull-timeOn-site3w ago
AgileGrafanaKubernetesLangChainLeadershipMove
Prepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Join CoreWeave's Observability team, where we are building the next-generation insights layer for AI systems. Our team empowers internal and external users to understand, troubleshoot, and optimize complex AI workloads by transforming telemetry into actionable insights.
Requirements
- 6+ years of experience in software or infrastructure engineering building production-grade backend systems and distributed APIs.
- Strong focus on developer-facing infrastructure, with a customer-obsessed approach to SDKs, CLIs, and APIs.
- Proficient in reliability engineering, including fault-tolerant design, SLOs, error budgets, and multi-tenant system resilience.
- Familiar with observability systems such as ClickHouse, Loki, VictoriaMetrics, Prometheus, and Grafana.
- Experienced in agentic applications or LLM-based features, including grounding, tool calling, and operational safety.
- Comfortable writing production code primarily in Go, with the ability to integrate Python components when needed.
- Collaborative experience in agile teams delivering end-to-end telemetry-to-insights pipelines.
- Preferred:
- Experience operating Kubernetes clusters at scale, especially for AI workloads.
- Hands-on experience with logging, tracing, and metrics platforms in production, with deep knowledge of cardinality, indexing, and query optimization.
- Experienced in running distributed systems or API services at cloud scale, including event streaming and data pipeline management.
- Familiarity with LLM frameworks, MCP, and agentic tooling (e.g., Langchain, AgentCore).
- You love transforming complex telemetry into actionable insights.
- You're curious about agentic interfaces and the future of AI observability.
- You're an expert in building scalable, reliable systems that empower developers and customers alike.
- Why CoreWeave?
- Be Curious at Your Core
- Act Like an Owner
- Empower Employees
- Deliver Best-in-Class Client Experiences
- Achieve More Together
Benefits
The range we've posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.In addition to a competitive salary, we offer a variety of benefits to support your needs. The benefits below reflect our US-based offerings; for roles in otherEquity / stock optionsPerformance bonus
Additional Information
CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at www.coreweave.com .
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at CoreWeave? Share your experience