Skip to main content
Back to jobs

Staff Software Engineer, Observability

External
CoreWeave logoCoreweave · Livingston, NJ
$188K–$250K/yrFull-timeOn-site1w ago
ExcelGrafanaKubernetesMicroservicesObservabilityPrometheus
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

We are seeking a highly experienced Staff Software Engineer to lead our efforts in building, maintaining, and optimizing highly scalable, reliable, and secure systems. The Observability team is responsible for deploying and maintaining critical infrastructure at CoreWeave including our logging, tracing, and metrics platforms as well as the pipelines that feed them.

Responsibilities

  • Lead and mentor engineers, fostering a culture of collaboration and continuous improvement.
  • Scale logging, tracing, and metrics platforms to support a global datacenter footprint.
  • Develop and refine monitoring and alerting to enhance system reliability.
  • Advise engineers across CoreWeave on optimal usage of Observability systems.
  • Automate interactions with CoreWeave's Compute Infrastructure layer.
  • Manage production clusters and ensure development teams follow best practices for deployments.
  • Required Qualifications:
  • 7+ years of experience in Software Engineering, Site Reliability Engineering, DevOps, or a related field.
  • Deep expertise across all observability pillars using tools like ClickHouse, Elastic, Loki, Victoria Metrics, Prometheus, Thanos and/or Grafana.
  • Expertise in Kubernetes, containerization, and microservices architectures.
  • Proven track record of leading incident management and post-mortem analysis.
  • Excellent problem-solving, analytical, and communication skills.

Requirements

  • Experience running and scaling observability tools as a cloud provider .
  • Experience administering large-scale kubernetes clusters.
  • Deep understanding of data-streaming systems.

Benefits

The range we've posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.In addition to a competitive salary, we offer a variety of benefits to support your needs. The benefits below reflect our US-based offerings; for roles in other locations, benefits vary and are shared during the hiring process. These include:Medical, dental, and vision insurance - 100% paid for by CoreWeaveCompany-paid Life InsuranceVoluntary supplemental life insuranceShort and long-term disability insuranceFlexible Spending AccountHealth Savings AccountTuition ReimbursementAbility to Participate in Employee Stock Purchase Program (ESPP)Mental Wellness Benefits through Spring HealthFamily-Forming support provided by CarrotPaid Parental LeaveFlexible, full-service childcare support with Kinside401(k) with a generous employer matchFlexible PTOCatered lunch each day in our office and data center locationsA casual work environmentA work culture focused on innovative disruptionCalifornia ApplicantsCalifornia Consumer Privacy ActEqual Opportunity & AccommodationsCoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. AllHealth insuranceDental insuranceVision insurance401(k)Paid time offFlexible scheduleEquity / stock optionsPerformance bonusParental leave

Additional Information

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at www.coreweave.com . CoreWeave is the AI Hyperscaler™, delivering a cloud platform of cutting edge services powering the next wave of AI. Our technology provides enterprises and leading AI labs with the most performant, efficient and resilient solutions for accelerated computing. Since 2017, CoreWeave has operated a growing footprint of data centers covering every region of the US and across Europe. CoreWeave was ranked as one of the TIME100 most influential companies of 2024. As the leader in the industry, we thrive in an environment where adaptability and resilience are key. Our culture offers career-defining opportunities for those who excel amid change and challenge. If you're someone who thrives in a dynamic environment, enjoys solving complex problems, and is eager to make a significant impact, CoreWeave is the place for you. Join us, and be part of a team solving some of the most exciting challenges in the industry. CoreWeave powers the creation and delivery of the intelligence that drives innovation.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at CoreWeave? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect