Staff Systems Engineer - Compute Support
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
Note: This position follows a hybrid work model, requiring 2 days per week on-site at our corporate office Chicago, IL or New York, NY Title: Staff systems engineer - Compute support Job Summary As a Staff Systems Engineer within the Reliability, Engineering & Operations (REO) team, you will serve as a technical linchpin for CME Group's mission-critical markets infrastructure. This role is designed for an experienced engineer who thrives at the intersection of high-scale distributed systems and cloud evolution. Operating on a Tuesday through Saturday schedule, you will provide critical support and continuity for our platforms, ensuring reliability and observability during critical market windows. !!Work hours: Tuesday to Saturday, 2pm to 11pm CST!! What You'll Get A supportive environment fostering career progression, continuous learning, and an inclusive culture. Broad exposure to CME's diverse products, asset classes, and cross-functional teams. A competitive salary and comprehensive benefits package. Explore our full range of benefits .
Responsibilities
- Architect Resilience: Own and resolve escalated incidents within our distributed computing architecture, performing root cause analysis across client server, hardware platforms and resources: CPU, memory, virtualization, clustering and Cloud computing
- Drive Cloud Evolution: Support the migration of markets applications & infrastructure to Google Cloud Platform (GCP)
- Engineer Observability: Partner with stakeholders to enhance alerting and observability to drive faster detection and data-driven decisions.
- Innovate through Automation: Eradicate manual toil by developing automated solutions that enhance system scalability and reliability across the enterprise.
- Lead and Mentor: Serve as a technical leader within the team, managing high-level discussions on architectural approaches and fostering the growth of junior engineers through intentional mentorship.
- Optimize Performance: Collaborate with SRE and Product teams to fine-tune system efficiency, ensuring our infrastructure meets the demands of global markets.
Requirements
- Operational Excellence: A strong background in incident response and SRE principles, with a demonstrated ability to take accountability for complex problems and change management.
- 10+ years experience with administering and troubleshooting the Linux Operating System.
- Deep Technical Expertise: Proven experience managing complex distributed systems, including client-server architecture, clustering, and hardware resource optimization (CPU & Memory).
- Automation Mindset: Proficiency in automating repetitive tasks to improve reliability; familiarity with Infrastructure as Code (IaC) and modern monitoring tools is highly valued.
- Leadership & Collaboration: Exceptional communication skills with the ability to lead technical presentations and influence cross-functional stakeholders.
- Cloud Proficiency: Proficiency with Google Cloud Platform (GCP) and a solid understanding of cloud-native architecture and virtualization.
- Mentorship: A passion for developing talent and sharing knowledge within a collaborative engineering culture.
- Required skills
- Deep knowledge of scripting languages such as Bash, PowerShell or Python
- Cloud experience, Google Cloud preferred
- Experience with administering and troubleshooting the Linux and Windows Operating Systems.
- Ability to present complex, technical ideas in a clear and concise manner to non-technical audiences.
- Detailed understanding of building systems to monitor and provide detailed observability into our infrastructure environment.
- #LI-DS2
- CME Group: Where Futures are Made
Benefits
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at cmegroup? Share your experience