Senior Software Engineer (Platform Data Reliability & Automation)
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Design and implement Infrastructure as Code ( IaC ) and automate the provisioning, monitoring, scaling, and lifecycle management of NoSQL, Streaming, and Caching platforms (e.g., Cassandra, Aerospike, Kafka, Redis).
- Drive end-to-end automation to enable repeatable, reliable, and self-service deployment of data services across cloud and hybrid environments.
- Ensure high availability, scalability, and resiliency of the platform data solutions .
- Define and enforce SLIs, SLOs, and error margins for data platforms to drive reliability engineering practices.
- Build highly performant , self-healing systems, automated failover, and auto scaling solutions for databases and streaming platforms.
- Develop observability solutions (metrics, logging, tracing) for Cassandra, Aerospike, Redis, and Kafka/MSK to ensure proactive issue detection.
- Partner with engineering and platform teams to provide reliable , scalable , and performant data services.
- Lead incident response for critical database/caching/streaming issues and drive root cause analysis with permanent automated fixes.
- Explore and apply AI-driven approaches to automation (e.g., anomaly detection, predictive scaling, automated remediation) to enhance operational efficiency.
- Drive and implement best practices, procedures, operational playbooks to facilitate knowledge sharing and support continuous improvement across global teams
- Mentor junior engineers and influence best practices in automation, distributed systems, and database reliability .
Requirements
- Bachelor's or Master's degree in Computer Science or a related field
- 6+ years of software development and DBRE experience, with at least 3+ years focused on Go and Infrastructure As Code with an emphasis on automation .
- Deep proficiency in Go (Golang) , with the ability to write performant, idiomatic, and maintainable code for production-scale systems
- Proven experience designing modular, domain-driven architectures in Go , supporting large and complex backend services
- Expertise with infrastructure-as-code tools such as Terraform, Ansible.
- Deep expertise operating large-scale NoSQL , caching and streaming platforms (Apache Kafka, Redis, AWS MSK, etc) including tuning, compaction strategies, repair operations, backup/recovery, and performance optimization.
- Solid understandin
Additional Information
Why Sony Interactive Entertainment? Sony Interactive Entertainment isn't just the Best Place to Play - it's also the Best Place to Work. Sony Interactive Entertainment (SIE) is the company behind the PlayStation brand. As a subsidiary of Sony Group Corporation, we're part of a proud legacy of innovation and excellence. SIE is a dynamic technology company, delivering cutting-edge hardware and network services to more than 100 million people and an entertainment leader, home to some of the most beloved and recognizable intellectual properties (IP) in the world. Our role at SIE is to create and nurture the experiences under the PlayStation brand, a name synonymous with entertainment excellence and creativity. Ready to level up your career? Join PlayStation as a Senior Software Engineer focused on Platform Data Reliability and Automation and help drive innovation at the forefront of interactive entertainment. You'll be part of a world-class engineering team focused on building seamless, scalable experiences for millions of players around the globe. At PlayStation, we're known not only for delivering exceptional gaming experiences but also for fostering a top-tier engineering environment centered on innovation, creativity, and technical excellence . We welcome passionate engineers who thrive on solving complex problems and are aligned with our vision of shaping the future of play. Role Overview: We are seeking a Senior Software Engineer (Platform Data Reliability & Automation) to play a critical role in building, automating, and operating scalable data platforms with a strong emphasis on Infrastructure as Code (IaC) and cloud technologies . This role focuses on the reliability and automation of NoSQL, Streaming, and Caching services across AWS and GCP environments. You'll design robust automation frameworks, ensure high availability, and partner with product and platform teams to deliver resilient, highly available infrastructure supporting billions of transactions and millions of players globally. By embracing Development & DBRE principles , driving automation-first practices , and applying AI/ML where applicable , you'll enhance system uptime, reduce manual toil, and enable velocity for engineering teams across PlayStation. You'll work closely with platform and product teams to ensure seamless integration and delivery of high-performance, scalable solutions across PlayStation's global ecosystem. Your contributions will directly support the reliability, scalability, and operational excellence of our data platform powering millions of players worldwide.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at PlayStation? Share your experience