Skip to main content
Back to jobs

Senior Site Reliability Engineer - Security

External
Scopely logoScopely · Bangalore Urban, India
Full-timeOn-site1mo ago30+ days old, may be filled
PythonGoRailsAWSTerraformCI/CD
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • Design and operate observability layers for AI platforms, including audit trails, tool-call logs, correlation IDs, traces, and runtime visibility across service boundaries.
  • Build automated findings-to-fix loops for AI and cloud platforms, integrating signals from CSPM tooling or future AI security products into pragmatic remediation workflows.
  • Implement reliability and hardening controls for internal AI systems, including alerting, health checks, rollback drills, kill-switch validation, rate limiting, and drift detection.
  • Codify detections, policies, and operational checks as code where they reduce toil, prevent regressions, and improve platform control.
  • Review platform and AI-application changes from a reliability and application-hardening perspective, especially around secrets, telemetry, external calls, risky MCP usage, and production readiness.
  • Own AI-platform-specific operational readiness and partner with central IT / SOC teams for escalations, postmortems, and shared incident workflows.
  • Continuously improve production readiness through automation, post-incident learning, and repeatable playbooks for AI runtime issues.

Requirements

  • 5+ years in SRE, production engineering, platform operations, or security automation with strong coding ability.
  • Hands-on scripting and coding experience, especially Python, with comfort working against APIs, log pipelines, and automation workflows.
  • Experience building pragmatic observability and alerting systems in AWS or comparable cloud environments.
  • Ability to reduce operational toil through automation while keeping signal quality high and false positives manageable.
  • Comfortable with incident handling, rollback thinking, SLA / SLO discussions, and evidence-driven postmortems.
  • Interest in AI systems, agent runtimes, and MCP-style integration risks is highly valuable.
  • Bonus
  • Software engineering background beyond scripting, including code review and testing habits.
  • Experience with AI agent runtimes, prompt / tool telemetry, or internal platform hardening for LLM-powered systems.
  • Experience with privacy-aware telemetry, compliance-oriented logging, or runtime protection
  • About Scopely

Additional Information

Scopely is looking for a Site Reliability Engineer- Security to join our Gen AI team in Bangalore! At Scopely, we care deeply about what we do and want to inspire play, every day - whether in our work environments alongside our talented colleagues, or through our deep connections with our communities of players. We are a global team of game lovers who are developing, publishing and innovating the mobile games industry, connecting millions of people around the world daily. For this role, we are seeking an experienced SRE focused on observability, automation, and runtime reliability for AI platforms and internal agentic systems. This is not a generic SOC role. It is an engineering role for someone who builds telemetry, automates findings-to-fix loops, improves production readiness, and keeps AI systems measurable, resilient, and controllable in production. Suitable backgrounds Site Reliability Engineers or backend engineers with strong automation skills Platform, DevSecOps, or observability engineers who build tooling, not just dashboards Cloud automation engineers with strong logging, tracing, and incident-response instincts Detection or security automation engineers who prefer code, pipelines, and remediation over ticket operations Tech stack Python for automation and workflow integration Infrastructure-as-code (Terraform or Pulumi) Observability concepts: metrics, logs, traces AWS logging, telemetry, IAM-aware diagnostics, and infrastructure scripting CI/CD integration for runtime checks, rollback drills, and policy validation Nice to have: Wiz, CrowdStrike, Orca, GuardDuty, WAF / RASP-style controls, MCP / agent telemetry


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Scopely? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect