Site Reliability Engineer
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Own the migration from Heroku to Google Cloud Platform: architecture, execution, and a cutover that doesn't surprise anyone
- Build and maintain the Postgres core, Fivetran pipeline: BigQuery data layer, and Hex reporting infrastructure
- Optimize the hot paths that matter most: key backend code paths and our heaviest third, party syncs, so performance holds as volume climbs
- Own monitoring, alerting, cost reduction, and proactive scaling: surface problems early, keep spend sane, and stay ahead of growth rather than reacting to it
- Lead incident response and write post-mortems that turn an outage into a permanent fix and a smarter team
- Set the operational bar across engineering and pull others up to it
- WHAT YOU HAVE
- Production reliability ownership:
Additional Information
Tern's user base is about to triple. Large host agencies are coming on board this year, and the infrastructure needs to be ready before they arrive. If you want to own the migration, build the monitoring, and be the person every other engineer at Tern depends on, this is your role. ABOUT TERN Tern is a venture-backed software company on a mission to reshape the $127B travel agency industry by giving power back to the entrepreneurs who built it. Nearly 98% of travel agencies are small businesses. These businesses have been chronically underserved by technology. We're here to change that. Our platform helps travel advisors run more efficient, professional, and profitable operations, giving them the modern infrastructure they need to lead the next chapter of travel. But the impact goes beyond business. Travel advisors help clients move more intentionally through the world. When a traveler works with an advisor, they're more likely to avoid overtouristed hotspots and more likely to spend their dollars in places where they can do real good. That's the kind of travel we want more of. At Tern, we believe in small business. We believe in the power of travel. And we're building the future of both. SITE RELIABILITY ENGINEER Everything Tern ships rides on the infrastructure underneath it. We're a Ruby on Rails application on Heroku, migrating to Google Cloud Platform, with a Postgres core and a data pipeline through Fivetran, BigQuery, and Hex. It's a solid foundation. It won't scale on its own. We're preparing for a major step-up in load, large host agencies coming on board, user base tripling, and this role builds the infrastructure that holds under that growth. You'll own the migration to GCP, own the monitoring and alerting that keeps production reliable, and own the hot paths that need to get faster before volume climbs. Do it well and every other engineer at Tern moves faster and sleeps better. This is a force multiplier role at the foundation of the product. This is a player-coach role. You'll start as the hands-on technical lead for infrastructure and grow into coaching and managing alongside it. That's the expectation from day one. Why Tern? Tern is building the platform travel advisors run their entire business on, and we believe the next great aggregator in travel gets built right now, on AI. The window to win this market is open, and we intend to take it. With capital and real momentum behind us, we're growing the engineering team this summer to put fuel behind what's already working. This role owns a real part of how Tern works, the kind of work the rest of the team depends on. We hire engineers with high agency and the trust to execute independently, set direction through their work, and lift the standard of everyone around them. We hire for proven execution, so we want to see specific evidence of what you've shipped and the impact it had, in your resume and in the room. Engineering Standards Ship regularly. Every engineering pair ships a working, tested feature every week. Fast and good are not in tension. Testing is part of the build. We own our code and how users experience it. We watch what we ship in production, stay close to support, AppSignal, Bugsnag, and Canny, and own our fixes end to end. We make the people around us faster and better, through code and design reviews that teach, through clear written work, and by unblocking our colleagues quickly. Every team is an AI team. We use AI fluently in daily work, from design review to test generation to debugging. We run Claude Code with a deep library of custom skills and agents, unlimited token usage, and we're actively building agentic and MCP-based tooling on top of our own systems. This is how we move fast and build well. Our stack Tern is a Ruby on Rails application with a Hotwire front end, backed by a Postgres database and hosted on Heroku, though we are migrating to Google Cloud Platform. Our data flows through to BigQuery, where we build reporting in Hex. Claude Code, with our own library of custom skills and agents, is part of daily development. You don't need to have used every piece, but you should be fluent enough to be productive quickly and excited to work this way.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at tern? Share your experience