Skip to main content
Back to jobs

Command Center Systems Engineer

External
CoreWeave logoCoreweave · Kenilworth, NJ
Full-timeOn-site1w ago
AgileDocumentationGrafanaIncident ResponseJiraLeadership
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

As the Command Center Systems & Automation Engineer, you are the technical engine behind our operational platform. You will own the tools, integrations, and automation that give our operators the intelligence they need to act fast and accurately. From enhancing our unified monitoring platform to refining alert intelligence and automating repetitive tasks, your work directly reduces risk and improves uptime across CoreWeave's global infrastructure. Own the evolution of the Command Center's "Single Pane of Glass", integrating signals from DCIM, EPMS, BMS, and other infrastructure platforms into a unified, real-time operational view. Transition alerting from threshold-based noise to intelligent, correlated logic using platforms such as Grafana, Prometheus, Ignition, or equivalent tools. Automate manual operator workflows, including reporting, vendor check-ins, initial triage, and ticket creation. Design and maintain data visualization dashboards and KPI reporting tools that give operators and leadership clear, actionable insight into fleet health and performance. Integrate and optimize Jira workflows to accelerate incident response, change management tracking, and task automation. Build and maintain technical runbooks and automated validation tools to ensure SOPs and MOPs are digitally enforced, not just documented. Engineer tracking for MTTD and MTTR, enabling data-driven performance improvement and RCA support. Partner with IT and Facilities Engineering teams to define system ownership boundaries and integration standards for in-scope platforms (DCIM, Ignition, BMS/EPMS).

Requirements

  • 5+ years of experience in Data Center Operations, Site Reliability Engineering (SRE), Network Operations, or Advanced Manufacturing, in a mission-critical environment.
  • Proficient in Python, Go, or other scripting languages for infrastructure automation and workflow orchestration.
  • Hands-on experience with enterprise monitoring and observability platforms (Grafana, Prometheus, Ignition, or similar).
  • Familiarity with SCADA, BMS, EPMS, and BACnet-type systems and how they integrate with operational tooling.
  • Skilled in SQL or NoSQL databases for operational reporting and KPI dashboard development.
  • Experience with time series data, telemetry platforms, and data visualization best practices.
  • Strong technical documentation skills, able to create high-fidelity SOPs and MOPs for high-availability environments.
  • Preferred:
  • Experience in hyperscale or AI infrastructure environments.
  • Familiarity with historian/telemetry platforms (Ignition or similar) and time series data pipelines.
  • Background in Agile development practices, able to work iteratively and deliver incremental improvements.
  • Deep understanding of physical infrastructure (Power, Cooling, Networking) as it relates to high-density GPU workloads.
  • Experience designing Jira project structures and automations for complex operational workflows.
  • Lean or Six Sigma methodology exposure focused on process automation and efficiency.
  • You love to build the "connective tissue" that makes complex systems feel simple.
  • You're curious about how to turn a mountain of raw data into a single, actionable alert.
  • You're an expert in staying calm under pressure and leading teams through high-severity incidents.
  • Why CoreWeave?
  • Be Curious at Your Core
  • Act Like an Owner
  • Empower Employees
  • Deliver Best-in-Cl

Benefits

Health insurance

Additional Information

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at www.coreweave.com .


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at CoreWeave? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect