Skip to main content
Back to jobs

Senior Software Engineer, Devops/SRE

External
roku logoRoku · Bengaluru, India
Full-timeOn-site2w ago
AnsibleAWSAzureCapacity PlanningCloudFormationCompliance
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

The Platform Infrastructure team ensures that all Roku systems run smoothly. These systems support over 100M+ users and billions in transaction revenue per year. We are a group of highly skilled infrastructure and software engineers that help build and operate systems at internet scale, including Platform (Kubernetes, Istio, Envoy, operators, and more) and Observability (OSS/CNCF-supported observability projects). We engage with multiple teams to achieve company-impacting results. We are seeking a talented and experienced DevOps/SRE (Site Reliability Engineering) Senior Software Engineer to join our dynamic team. The ideal candidate will have a strong background in DevOps practices, cloud infrastructure management, automation, and team leadership skills. If you have a consistent track record of architecting and building large-scale systems; enjoy solving intriguing system challenges at internet-scale; if you are innovative at heart; and have a great balance of skills in learning, organizing, building, and enjoy making an impact, this role might be a great fit for you!

Responsibilities

  • Oversee the design, implementation, and maintenance of scalable and resilient cloud infrastructure on platforms spanning AWS and GCP. Ensure high availability, reliability, and performance of critical systems
  • Collaborate with your peers to be responsible for the entire software lifecycle, seek the right problem to solve, and strive for excellence
  • Manage individual project priorities, deadlines, and deliverables related to your technical expertise and assigned domains
  • Lead incident response efforts, working closely with cross-functional teams to resolve issues quickly and minimize downtime. Implement effective incident management processes and post-incident reviews
  • Collaborate with security teams to ensure the integrity and security of infrastructure and applications. Implement security best practices and compliance standards
  • Identify performance bottlenecks and optimize system resources for maximum efficiency. Conduct regular performance tuning and capacity planning exercises
  • Drive continuous improvement initiatives within the team and across the organization. Proactively identify areas for enhancement and implement solutions to address them
  • Maintain comprehensive documentation of systems, processes, and procedures. Foster a culture of knowledge sharing and contribute to the collective learning of the team
  • Participate in 24x7 on-call rotation, and be available to work with global teams in the event of critical outages
  • We're excited if you have
  • 12+ years of experience in DevOps/SRE roles
  • Experience in cloud-focused software development, preferably in Go, Python, or other object-oriented programming languages
  • Experience with a number of the following: ECS, Docker, Kubernetes, Envoy, Istio, Linkerd, Solo
  • Experience with Infrastructure as Code (IaC) tools such as Terraform, Ansible, or CloudFormation
  • Strong understanding of distributed systems, microservices architecture, and cloud-native technologies
  • The drive and self-motivation to understand the intricate details of a complex infrastructure environment
  • Strong proficiency in cloud platforms such as AWS, Azure, or GCP
  • Solid understanding of networking, security, and compliance principles
  • Proven track record of driving results and delivering high-quality solutions in a fast-paced environment
  • Demonstrated ability to communicate clearly with both technical and non-technical project stakeholders, with the ability to work effectively in a cross-functional team environment
  • Certifications in relevant technologies such as Certified Kubernetes Administrator (CKA), AWS Certified DevOps Engineer, or Certified Information Systems Security Professional (CISSP) are preferred
  • BS Degree in Computer Science or Equivalent
  • #LI-SK8
  • Our Hybrid Work Approach
  • Roku fosters an inclusive and collaborative environment where teams work in the office Monday through Thursday. Fridays are flexible for remote work except for employees whose roles are required to be in the office five days a week or employees who are in offices with a five day in office

Benefits

Vision insuranceRemote work optionsFlexible schedule

Additional Information

Teamwork makes the stream work. Roku is changing how the world watches TV Roku is the #1 TV streaming platform in the U.S., Canada, and Mexico, and we've set our sights on powering every television in the world. Roku pioneered streaming to the TV. Our mission is to be the TV streaming platform that connects the entire TV ecosystem. We connect consumers to the content they love, enable content publishers to build and monetize large audiences, and provide advertisers unique capabilities to engage consumers. From your first day at Roku, you'll make a valuable - and valued - contribution. We're a fast-growing public company where no one is a bystander. We offer you the opportunity to delight millions of TV streamers around the world while gaining meaningful experience across a variety of disciplines.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at roku? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect