Skip to main content
Back to jobs

Senior Software Engineer, Site Reliability

External
Upstart logoUpstart · US
Full-timeRemote3w ago
AgileCI/CDCloudFormationDatadogIncident ResponseJavaScript
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Requirements

  • Minimum requirements:
  • Minimum of 6 years combined experience between Software Engineering, Site Reliability, and/or DevOps Engineering including CI/CD, TDD, internal tooling, observability, and other agile development practices
  • Proficiency coding Python, Go, JavaScript/TypeScript
  • Proficiency with Infrastructure as Code (Terraform, CDK, Cloudformation, etc.)
  • Software engineering background with experience building internal tooling from scratch, and other agile development techniques
  • Strong software design & architecture skills
  • Fundamentally sound with data structures & algorithms
  • Experience with on-call and incident management environments
  • Experience with observability, monitoring, and reporting tools (e.g., Datadog, Sumologic, , etc.)
  • Experience supporting SaaS software in a microservice-oriented cloud environment
  • Ability to work with multiple teams for enterprise-wide deliverables
  • Data/metrics-driven mindset
  • Experience with service mesh
  • Full Stack development skills
  • Experience building tooling for an observability platform
  • Experience leveraging LLM/GenAI to improve SRE efficiency and processes
  • Position Location - This role is available in the following locations: Remote, San Mateo, Columbus, Austin, NY
  • Time Zone Requirements - This team operates across all U.S. time zones.
  • Travel Requirements - This team has regular on-site collaboration sessions. These occur 3 days per quarter at an Upstart office. If you need to travel to make these meetups, Upstart will cover all travel related expenses.
  • At Upstart, your base pay is one part of your total compensation package. The anticipated base salary for this position is expected to be within the below range. Your actual base pay will depend on your geographic location-with o

Benefits

Health insuranceRemote work options

Additional Information

About Upstart At Upstart, we're united by a mission that matters: to radically reduce the cost and complexity of borrowing for all Americans. Every day, we bring creativity, experimentation, and advanced AI to reshape access to credit, helping millions move forward financially with clarity and confidence. As the leading AI lending marketplace, we partner with banks and credit unions to expand access to affordable credit through technology that's both radically intelligent and deeply human. Our platform runs over one million predictions per borrower using more than 1,800 signals, powering smarter, fairer decisions for millions of customers. But the numbers only hint at the impact. Every idea, every voice, and every contribution moves us closer to a world where credit never stands between people and their financial progress. We're proudly digital-first, giving most Upstarters the flexibility to do their best work from wherever they thrive, alongside teammates across 80+ cities in the US and Canada. Digital-first doesn't mean distant. We're intentional about in-person connection through team onsites, planning sessions, and moments that spark creativity and trust. And whether you choose to work primarily from home or collaborate in-person from one of our offices in Columbus, Austin, the Bay Area, or New York City (opening Summer 2026), you'll have the support to work in the way that works best for you. If you're energized by tackling meaningful problems, excited to innovate with purpose, and motivated by work that truly matters, we'd love to hear from you. The Team Upstart's Site Reliability Engineering (SRE) team owns the reliability, resiliency, and observability of Upstart's production systems. The SRE team builds tooling and automation to monitor the health of our infrastructure and create a fast, reliable, and productive environment for other engineers and a world-class experience for our customers. SRE defines Upstart's strategy for technology operations risk mitigation, which includes disaster planning and on-call procedures. We use data-driven approaches to drive our decisions, and provide reports and insights to the business to improve visibility into the system and customer experience. As a Senior Software Engineer focused on Site Reliability Tooling , your work will directly impact the success of the SRE team and all of Upstart. Your expertise will inform the team's direction, and your work with other SREs and Upstart engineers will make Upstart's systems as effective as possible for our customers. SRE at Upstart is ever-changing, and you will be a primary contributor in shaping our future path. How you'll make an impact: Embody and share SRE principles at Upstart Exercise state-of-the-art SRE practices throughout the company Uphold a culture of visibility, ownership, and responsibility around service reliability Implement standards for monitoring microservices, web apps, mobile apps, databases, Kubernetes clusters, and machine learning platforms, in a fast-paced environment Improve incident response practices, both within SRE and throughout the company Automate away toil that make sense to be automated


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Upstart? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect