Skip to main content
Back to jobs

Senior SRE/DevOps (Platform Tribe)

External
playson logoPlayson · European Union
Full-timeRemote18mo ago
ArgoCDAWSCI/CDDatadogDockerGit
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

Playson is a globally recognised iGaming supplier delivering a high-performance, microservices-based platform designed to process billions of financial transactions every day. Operating at high scale (5-7k RPS), our cross-regional infrastructure is built to deliver near-zero latency and a seamless player experience under constant load. We're looking for a Senior SRE / DevOps Engineer to join our Platform Tribe - a lean & senior team where ownership is high and expectations are even higher. This is a deeply hands-on role at the core of a high-traffic system, where you'll be directly responsible for maintaining reliability, performance, and stability in a fast-paced environment. You'll be working on real-time production challenges, handling incidents, managing alerts, and being part of a critical on-call rotation. This role requires resilience, strong decision-making under pressure, and a proactive mindset to continuously improve systems operating at scale. If you thrive in high-load environments, enjoy solving complex production issues, and want to have a direct impact on systems used by millions - this is the place for you.

Responsibilities

  • Own system reliability by actively monitoring platform health, managing alerts, and responding to incidents in real time
  • Participate in 24/7 on-call rotations, taking full ownership of production stability in a high-traffic (5-7k RPS) environment
  • Investigate incidents, perform root cause analysis, and implement long-term fixes to prevent recurrence
  • Build and continuously improve monitoring, alerting, and observability across the Kubernetes (EKS) ecosystem
  • Deploy, manage, and optimise infrastructure using Terraform, Helm, and GitOps tools (Flux/ArgoCD)
  • Drive automation and proactively improve system resilience, reducing manual intervention and recurring issues
  • Maintain and evolve CI/CD pipelines and infrastructure-as-code practices
  • Collaborate closely with engineering teams to support deployments and minimise user impact in a live environment
  • Introduce and integrate new tools and technologies to enhance scalability, reliability, and performance
  • Handle environment-specific requests and ensure smooth day-to-day platform operations under constant load

Requirements

  • Strong hands-on experience with Kubernetes (deployment, scaling, troubleshooting) in high-load environments
  • Experience with GitOps tools such as FluxCD or ArgoCD
  • Proven experience in incident response, root cause analysis, and postmortems in production systems
  • Solid experience with AWS, Terraform, Docker, and CI/CD pipelines
  • Experience with monitoring and observability tools such as Datadog, Prometheus, Grafana, and logging stacks like ELK or CloudWatch
  • Strong understanding of networking concepts and protocols
  • Proficiency in at least one scripting language (e.g. Python, Go, Node.js)
  • Experience working with version control systems (Git)
  • Familiarity with incident management tools like PagerDuty, Opsgenie, or similar
  • Ability to operate effectively in a fast-paced, high-pressure environment with strong ownership and accountability
  • Proactive, resilient mindset with a focus on continuous improvement and system stability

Benefits

Competitive Salary: We offer a competitive salary, subject to annual performance reviewsQuarterly Bonuses: Benefit from a transparent and systematic quarterly bonus systemUnlimited Paid Vacation: Enjoy unlimited paid vacation leave, including Ukrainian bank holidaysUnlimited Paid Sick Leave: Take unlimited paid sick leave whenever necessaryFlexible Schedule: We offer a flexible work schedule to accommodate your needsRemote Work: Choose to work remotely, providing greater flexibility and comfortMedical Insurance: Receive comprehensive medical insurance for both you and a significant otherFinancial Support for Life Events: We provide financial support during special life eventsProfessional Development: Get reimbursement for professional development courses and trainingInternational exposure : Attend industry expos, team gatherings & global meet-upsB2B contractsRecruitment ProcessHR Interview (30-45 min)Interview with a Product Owner (60 min)Technical interview (90 min)Final Interview with C-level (60 min)Health insurancePaid time offRemote work optionsFlexible schedulePerformance bonus

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at playson? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect