Skip to main content
Back to jobs

Lead DevOps Engineer

External
Litmus logoLitmus · Toronto, Canada
Full-timeRemote2mo ago
AWSCI/CDCloudflareEdge ComputingGCPGitLab
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

Litmus is building the industrial IoT platform of record, and our DevOps function is the engine that lets engineering move fast with confidence. This is a senior technical leadership role - reporting directly to the Head of Technology - for someone who is ready to own the DevOps function end-to-end across the entire company, and to lead its transformation into an AI-enabled engineering discipline. You will inherit a capable, distributed team and a meaningful technical foundation: self-hosted GitLab for CI/CD, multi-cloud infrastructure across AWS and GCP, Kubernetes (EKS) workloads, and an on-premises VMware estate. Your mandate is to level up this foundation, drive down delivery friction for the broader engineering organization, and make strong technical decisions without needing direction for day-to-day operations. If you thrive at the intersection of platform engineering, cloud infrastructure, and security automation - and you want to be the person who sets the standard - this role is for you.

Responsibilities

  • Technical Leadership & Team
  • Lead and mentor a distributed DevOps team spanning North America and India, including an infrastructure security-focused sub-team.
  • Serve as the primary technical decision-maker for the DevOps function - architecture, tooling choices, prioritization, and delivery standards.
  • Partner with Engineering, QA, and Product leadership to reduce delivery friction and improve DORA metrics (lead time, deployment frequency, MTTR, change fail rate).
  • Represent the DevOps function at the leadership level, including communicating roadmap, risks, and platform health to the Head of Technology and broader Technology leadership.
  • CI/CD Platform (GitLab)
  • Own the self-hosted GitLab platform - upgrades, runner fleet management (VMware-hosted and cloud), and platform health.
  • Drive maturity of the CI/CD Catalog and shared template library (ci-common/gitlab-templates), ensuring teams can self-serve without bespoke pipeline configuration.
  • Evolve pipeline capabilities: container image scanning, IaC static analysis, SAST, SBOM/CVE generation, and MR-triggered security scans.
  • Establish and enforce merge request standards, branch protection policies, and CODEOWNERS governance across the GitLab organization.
  • Kubernetes & Cloud Infrastructure
  • Own EKS day-2 operations: cluster upgrades, node group management, networking (private API endpoints, Cloudflare tunnel integration), and reliabi

Benefits

Health insurance

Additional Information

Who is Litmus Litmus is building the data foundation that powers industrial AI. AI doesn't work without real-world, contextualized data - Litmus makes that data usable. As AI adoption accelerates, most industrial environments still can't access or use their operational data. We solve that gap. We're a growth-stage software company helping manufacturers access, structure, and use real-time data from machines, systems, and sensors at the edge. Our platform sits at the intersection of edge computing, AI, and industrial operations, enabling some of the world's largest companies to run operations in real time, reduce downtime, and optimize production. Backed by leading investors and trusted by global manufacturers and partners like Google, Microsoft, Dell, Oracle, and Mitsubishi, Litmus is powering the shift toward software-defined manufacturing. Why join Litmus Build the infrastructure that makes industrial AI possible AI is moving beyond the cloud and into the physical world. At Litmus, you'll build the infrastructure that enables real-time data to power AI and machine learning systems in production environments. Work on problems where software meets the real world Most AI systems fail without access to real-world data. You'll build the layer that makes them viable in production. We solve challenges at the intersection of distributed systems, real-time data, and industrial constraints - where reliability, scale, and performance are non-negotiable. Have real impact, fast You'll work on systems used by real customers in production, with direct impact on product and company trajectory. As a scaling company, we move quickly. You'll have ownership, visibility, and the ability to shape both product and company as we scale. Join a high-performance team We're building a team that holds a high bar and pushes each other to improve. You'll work alongside experienced operators, engineers, and leaders who have done this before and are building again at scale. We hire people who take ownership, move quickly, and care about outcomes. No passengers. Our culture At Litmus, the team is collaborative, curious, and low ego. People are scrappy, take ownership, and look for ways to make an impact. We value empathy just as much as execution, whether that's in how we build, how we communicate, or how we support each other. We're a growing company, so things move quickly and not everything is perfectly defined. If you enjoy figuring things out, working closely with others, and making steady progress, you'll do well here.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Litmus? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect