Rightsize workloads for efficient resource utilization in Kubernetes and cloud service
Contribute to the development, deployment and operations for new microservices within the platform
Review technical specifications to provide guidance and help development teams drive operational excellence
Work hand-in-hand with software developers to facilitate the development and adoption of "Paved Road" solutions and DevSecOps processes
Support large-scale services across multiple environments
Assist in resolution efforts for problems ranging from infrastructure network layers to application scaling
This role includes participation in an on-call rotation - we believe in shared ownership of our platform and aim to build systems that are resilient, observable, and require minimal intervention.
Knowledge, Skills and Abilities:
4+ years of experience in DevOps, systems engineering, or a related role.
Strong experience with
Kubernetes
Helm
Python
Terraform
Linux
Git and Github
Strong experience working with at least one major cloud platform (AWS, Azure, or GCP).
Fundamental understanding of Kubernetes and Helm. Experience in building and running software systems on Kubernetes clusters in production
Hands-on experience with infrastructure provisioning and configuration using Infrastructure as Code (IaC) principles
An understanding of design for scalability, performance, efficiency and reliability.
Self-motivated and proactive, able to take ownership and deliver results.
Ability and willingness to learn about new technologies.
Effective communication with technical and non-technical stakeholders
Job Description:
DataRobot delivers AI that maximizes impact and minimizes business risk. Our platform and applications integrate into core business processes so teams can develop, deliver, and govern AI at scale. DataRobot empowers practitioners to deliver predictive and generative AI, and enables leaders to secure their AI assets. Organizations worldwide rely on DataRobot for AI that makes sense for their business - today and in the future.
We are searching for a DevOps Engineer II who enjoys working with engineers across disciplines and teams to architect efficient, reliable and scalable software systems. This role requires expertise in Kubernetes and cloud computing, as well as strong automation skills in Python, Helm and Terraform.