Skip to main content
Back to jobs

Platform Engineer - (Site Reliability Engineering)

External
bitso logoBitso · Latin America
Full-timeOn-site1d ago
CI/CDIncident ResponseJavaKubernetesLeadershipObservability
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

With over 9 million users, Bitso is the leading cryptocurrency platform in Latin America. We are developing the cryptocurrency ecosystem in the region and enabling financial inclusion. We believe crypto is the future of finance, and we're committed to making it useful by providing equal access to safe and intuitive financial products. When we hire people for our team, we specifically test for the following traits in addition to our cultural values: Mission-Driven : We seek individuals who are passionate about crypto and Bitso's mission and resilient in facing industry challenges High Sense of Urgency : We prioritize candidates who demonstrate a high sense of urgency and responsibility. Exceptional Hard Skills : We seek individuals who possess exceptional skills in their respective fields, with no room for mediocrity. Self-Management : We look for individuals who can independently manage their work, career, and professional development. Compensation & Benefits At Bitso, you are taking the front seat on the edge of crypto innovation, creating the next generation of crypto-powered products. So for those willing to commit, adapt and pioneer the most important change of the century we offer: Me Time program, including unlimited paid time off. Remote-first work environment. Employee Stock Option program. Zero trading fees through our Bitso Alpha app. Extended Family Leave Policy: all birthing parents,

Responsibilities

  • Own and execute on-call shifts end-to-end: acknowledge pages within SLA, declare incidents, assign roles, maintain comms cadence, and drive to resolution
  • Build automation that drives the Sev1/Sev2 postmortem workflow - from scheduling and facilitation reminders to action-item assignment, ownership tracking, and due-date enforcement
  • Leverage AI to identify patterns across incidents and propose systemic fixes: runbook improvements, alert tuning, platform hardening, and process changes
  • Build and extend internal automation and tooling, including AI-assisted incident response workflows, to reduce manual toil and accelerate detection and resolution
  • Contribute to and improve the observability ecosystem - dashboards, alert configurations, and early-warning signals across Bitso's platform
  • Participate in change and maintenance management processes, applying risk management to reduce deployment-related incidents
  • Collaborate with engineering squads across the company to surface platform risks and drive preventive actions Keep incident tooling, runbooks, and severity criteria accurate, current, and useful for the broader engineering org
  • #LI-Remote

Requirements

  • Proven ability to operate confidently in high-pressure incident scenarios, including communicating clearly with senior stakeholders and leadership while a production issue is live
  • Hands-on experience with Kubernetes - comfortable deploying, debugging, and navigating pod-level issues
  • Solid understanding of CI/CD pipelines and modern DevOps practices
  • Software development background in any language; ability to read, write, and debug code is essential (Python or Java experience is a plus)
  • Strong automation mindset: you identify repetitive toil and your first instinct is to eliminate it, not absorb it
  • Experience building or working with AI agents or LLM-based workflows is highly desirable
  • Strong interpersonal and written communication skills
  • Self-directed learner who doesn't need a fully defined path to start contributing
  • Fintech or crypto industry background is a plus - familiarity with the domain vocabulary accelerates onboarding and incident triage

Benefits

Paid time offRemote work optionsEquity / stock options

Additional Information

Working At Bitso We are a diverse team that takes pride in understanding the perspectives of others. We fully embrace working remotely and we are eager to act, improve and accelerate progress inside and outside of our organization. To drive revolutionary changes in society and make crypto useful, we delight our customers with world-class products, deep care, and intentional empathy. Your Purpose At Bitso, reliability isn't an afterthought - it's a competitive advantage. As a Platform Engineer 2 focused on Incident Management, you'll own the full incident lifecycle: from active response during live incidents, to driving postmortems, building automation, and eliminating the root causes that create toil in the first place. You'll be the person who asks "how do we make sure this never happens again?" - and then actually builds it. If you thrive under pressure, love automation, and want to make a measurable dent in how a high-scale crypto platform operates, this role was designed for you. Reports To Incident Management Manager


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at bitso? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect