Skip to main content
Back to jobs

Senior Site Reliability Engineer

External
crexi logoCrexi · Los Angeles, CA
Full-timeOn-site4w ago
AuditingAWSAzureBudget ManagementCI/CDCloudFormation
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • Leads the design and development of self-service infrastructure-as-code tooling, enabling engineering teams to manage and provision infrastructure autonomously and at scale.
  • Architects and drives automation frameworks for provisioning, configuration management, deployment, and monitoring, significantly reducing manual intervention and improving operational efficiencies across the organization.
  • Serves as a subject matter expert across a broad range of infrastructure domains including networking, storage, compute/containers, and security, providing guidance and escalation support to the broader engineering organization.
  • Leads code reviews and sets standards for infrastructure changes, ensuring quality, security, and maintainability.
  • Owns and drives architectural designs and planning discussions, influencing platform-wide decisions and long-term roadmap.
  • Partners closely with software engineering leadership to define and improve reliability standards, SLOs, SLIs, and error budgets for software products.
  • Leads the evaluation, selection, and management of relationships with third-party service providers, ensuring reliability, security compliance, and cost-effectiveness of external services.
  • Authors and maintains comprehensive documentation for system architecture, runbooks, processes, and best practices to facilitate knowledge sharing, onboarding, and organizational resilience.
  • Owns and evolves Crexi's cloud infrastructure strategy, with a focus on scalability, security, cost optimization, and operational excellence.
  • Mentors and develops junior and mid-level SRE and infrastructure engineers through code reviews, pairing, and technical guidance.
  • Drives incident response, post-mortem processes, and reliability improvement initiatives across the engineering organization.
  • Performs other duties as assigned.

Requirements

  • BS degree in Computer Science or equivalent relevant work experience.
  • 5+ years of experience operating infrastructure, with at least 2 years in a senior or lead SRE/infrastructure capacity.
  • Advanced proficiency in a programming language, such as Python, Go, C#, or Typescript.
  • Deep experience with infrastructure best practices including infrastructure-as-code, security auditing, cost optimization and FinOps, CI/CD automation, pipeline architecture, version control, code review, SLO/SLI/error budget management, alerting, and incident response.
  • Thorough knowledge of Linux system administration and internals.
  • Thorough knowledge of Docker/containers and Kubernetes/container orchestration, including cluster management and platform-level design.
  • Expert-level experience with infrastructure-as-code tooling such as Terraform, CloudFormation, or Pulumi.
  • Experience designing and managing database systems at scale, including performance tuning and high availability configurations.
  • Extensive experience managing cloud infrastructure, ideally AWS and at least one additional cloud provider (GCP, Azure).
  • Strong software development background with experience contributing to production-grade systems.
  • Demonstrated experience operating and scaling infrastructure in high-growth or fast-moving startup environments.
  • Experi

Benefits

Health insuranceVision insurance

Additional Information

About Crexi Crexi is reimagining commercial real estate with an AI-powered platform built to deliver smarter, more efficient solutions at every stage of the deal lifecycle. From real-time data and market insights generated by Crexi Intelligence, to targeted property marketing and seamless deal management through Crexi PRO, and a transparent, time-bound bidding experience with Crexi Auction- Crexi enables users to evaluate opportunities, maximize exposure, and close with speed and confidence. To date, Crexi has facilitated over $1 trillion in transactions, 8.6 billion square feet leased, and supports a growing community of more than 2 million monthly active users. Crexi's mission is to catalyze the next generation of commercial real estate through three core pillars: Access, Innovation, and Connection. Crexi's platform democratizes CRE by providing unprecedented access to market insights and opportunities, accelerates CRE dealmaking with purpose-built technology that enhances speed and transparency; and empowers CRE professionals with a centralized platform designed for real-time collaboration and success. About This Role: The Senior Site Reliability Engineer leads the reliability, performance, and scalability of Crexi's infrastructure. The role fabricates, builds, and maintains robust and resilient systems while driving observability and proactive health monitoring across the platform. The Senior SRE acts as an escalation point and technical authority during incidents, leading efforts to minimize downtime and optimize system performance. As a senior technical leader, the Senior SRE champions a culture of reliability, efficient post-mortems, continuous improvement, and effective deployment practices, while mentoring junior and mid-level engineers on the team.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at crexi? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect