Skip to main content
Back to jobs

Senior Staff Software Engineer, Managed Orchestration

External
Crusoe logoCrusoe · San Francisco, CA
Full-timeOn-site2mo ago
CI/CDGCPKubernetesLeadershipLinuxPerformance Optimization
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

We are actively seeking an exceptional Senior Staff Software Engineer for our cloud software team who will serve as a technical leader and strategic force behind the operations of our cutting-edge infrastructure. Your expertise will be instrumental in defining, designing, and scaling our carbon-reducing operating model, as well as driving the long-term vision and reliability of critical hardware, software, and network components. In this role, you will lead complex architectural initiatives, set technical direction across teams, and establish best practices in code quality, system design, and operational excellence. You will author and review high-impact proposals and architecture documents, guiding decisions that shape the organization's technical landscape. You will evaluate tools and frameworks with a long-term, system-wide perspective, carefully considering their impact on reliability, scalability, operational costs, and organizational adoption. Your deep expertise in orchestration and optimization will be instrumental in advancing our managed Kubernetes and AI training clusters, ensuring they set the industry standard for reliability, performance, and efficiency. What You'll Be Working On: Drive the development of scalable, resilient, and high-performance software solutions, ensuring alignment with and influence over the strategic objectives outlined in the Crusoe Cloud roadmap Provide technical leadership across multiple teams, fostering a culture of innovation, engineering excellence, and accountability while enabling teams to deliver cutting-edge cloud solutions Define and evolve architectural standards and best practices, ensuring consistency, scalability, and long-term maintainability across systems Continuously stay ahead of emerging trends and technologies in cloud software, proactively shaping Crusoe's technical direction and incorporating innovations that maintain competitive advantage Act as a mentor and multiplier for engineering talent, elevating team capabilities through coaching, design reviews, and thought leadership in technical discussions Lead cross-functional initiatives and drive alignment between engineering, product, and infrastructure teams to deliver cohesive and impactful solutions What You'll Bring to the Team: You have 10+ years of experience working in software engineering, with deep expertise in Systems Engineering and large-scale distributed systems You possess 3+ years of programming experience in GoLang, with a track record of delivering production-grade systems You have extensive experience with Kubernetes and Linux Engineering, including advanced debugging and performance optimization You are highly skilled in infrastructure as code and have a strong understanding of complex systems-level challenges at scale You have experience with Terraform and GCP (preferred), with the ability to influence platform-level decisions You have a strong understanding of Argo, CI/CD, and Automated Testing pipelines, including designing and scaling them for large organizations You can architect, build, and evolve Kubernetes operators and controllers, owning critical components that ensure the reliability, scalability, and efficiency of the Kubernetes environment You have experience designing and operating large-scale systems comparable to leading services like Google Kubernetes Engine (GKE) and Amazon Elastic Kubernetes Service (EKS) You can lead and deliver critical, high-impact projects, driving initiatives across networking, quality control, automation, and system reliability at an organizational level You can define and own system architecture end-to-end, including CI/CD pipelines, ensuring scalability, security, and long-term sustainability You have exceptional communication skills, with the ab

Benefits

Vision insurance

Additional Information

Crusoe is on a mission to accelerate the abundance of energy and intelligence . As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Crusoe? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect