Design, build, ship, and maintain the core observability libraries, tools, and patterns used by all of Checkr's engineering teams
Troubleshoot complex production issues across the stack, with respect to performance, availability, and data quality
Participate in a cross-organization incident response team, driving continuous improvement
Contribute to architectural discussions within the SRE team and with cross-functional teams
Influence cross-team projects and the reliability roadmap to enable engineering and help Checkr customers
Provide consultation and feedback across teams to ensure we are building highly reliable, efficient, and scalable systems
What you bring:
Bachelor's degree in Computer Science or related field, or equivalent practical experience
2+ years of software engineering experience, including 1+ years focused on reliability, scalability, and efficiency of distributed systems
Proficiency in Python (preferred), Go, or Ruby within Linux environments, and strong understanding of microservices, asynchronous systems, and remote APIs.
Experience developing and operating production, customer-facing systems in AWS or Azure using Kubernetes, Docker, and Terraform.
Skilled in observability and incident response practices using tools such as Datadog, Splunk, Grafana, Prometheus, and OpenTelemetry, with a focus on continuous improvement.
Strong collaboration, documentation, and communication skills, with experience leading small projects, promoting platform adoption, and fostering a self-service, product-first mindset.
An A-player mindset with a strong bias for action: you raise the bar, move with urgency, stay resilient through ambiguity, and take ownership to deliver meaningful outcomes.
Pay Transparency Disclosure
On-target Earnings OR Base Salary range (San Francisco, CA)
$150,000 - $176,000 USD
On-target Earnings OR Base Salary range (Denver, CO)
$127,000 - $149,000 USD
Benefits
A fast-paced and collaborative environmentLearning and development allowanceCompetitive cash and equity compensation, and opportunity for advancement100% medical, dental, and vision coverageUp to $25K reimbursement for fertility, adoption, and parental planning servicesFlexible PTO policyMonthly wellness stipendEqual Employment Opportunities at CheckrApplicant Privacy PolicyIf you are a California resident or are located in Alberta or British Columbia, our Applicant Privacy Policy applies to our collection and processing of your personal information when you apply for a role with us or otherwiseDental insuranceVision insurancePaid time offRemote work optionsFlexible scheduleEquity / stock optionsParental leave
Additional Information
About Checkr
Checkr is building the data platform to power safe and fair decisions. Over 140,000 companies and millions of people rely on Checkr for AI verification in the moments that matter most: getting a new job, a new place to live, a car ride, childcare, even a date. Customers include Uber, Pennymac, Airbnb, Doordash, Amazon, and Anthropic.
We're a team that thrives on solving complex problems with innovative solutions that advance our mission. Checkr is recognized on Forbes Cloud 100 2025 List and is a Y Combinator 2024 Breakthrough Company .
As a Software Engineer II on the Site Reliability Engineering team within the Platform Engineering group at Checkr, you will identify reliability challenges impacting engineering teams and platforms and develop innovative solutions to resolve them. You will strive towards the right balance between enforcing standardization and accommodating tailored workflows. The person in this role will have the opportunity to demonstrate a degree of autonomy in their work and help with complex support requests, while having an impact across engineering.