Associate Site Reliability Engineering - Openshift (Brno, Czech Republic)
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Benefits
Additional Information
The Red Hat IT OpenShift team is looking for an Associate Site Reliability Engineer (SRE) to design, develop, scale, and operate our Red Hat Hybrid OpenShift Platforms (on-prem & cloud). As an Site Reliability Engineer, you will contribute to running Red Hat OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and eliminating toil through automation. In the IT OpenShift team you will have the opportunity to influence the complex challenges of scale which are unique to Red Hat IT managed platform services, while using your skills in coding, operations, and large-scale distributed system design. We develop, deploy, and maintain Red Hat's next-generation mission critical platform across hybrid cloud infrastructures. We are a global team operating on-premise and in the public cloud, using the latest technologies from Red Hat and beyond. Red Hat relies on teamwork and openness for its success. We learn from our failures in a blameless environment to support the continuous improvement of the team. At Red Hat, your individual contributions have more visibility than most large companies, and visibility means career opportunities and growth. Successful applicants must reside in a state where Red Hat is registered to do business. What Will You Do? Design, build, and manage our large scale infrastructure and platform services, including public cloud, private cloud, and datacenter-based Automate cloud infrastructure through use of technologies (e.g. auto scaling, load balancing, etc.), scripting (python and golang), monitoring and alerting solutions (e.g. Splunk, Splunk IM, Prometheus, Grafana, Catchpoint, DataDog etc) Design, develop, and become expert in IT's Red Hat OpenShift offerings by leveraging emerging industry standards Build & support standardized CI/CD platform components using OpenShift Pipelines and Tekton, GitLab to enable multiple application deployments Apply Infrastructure as Code methodologies using GitOps practices with ArgoCD for declarative platform management Breakdown complex engineering efforts into consumable chunks while working with teams to understand deliverables Design and development of software like Kubernetes operators, webhooks, cli-tools Implement and maintain intelligent infrastructure and application monitoring designed to enable application engineering teams Ensure the production environment is operating in accordance with established procedures and best practices Escalate to seniors or team leads to support for high severity and critical platform-impacting events Provide feedback around bugs and feature improvements to the various Red Hat Product Engineering teams Design software tests and perform peer reviews to increase the quality of our codebase Help and develop peers' capabilities through knowledge sharing, and collaboration Participate in a regular on-call schedule, supporting the operation needs of our tenants Drive sustainable incident response and contribute to blameless postmortems Work within a small agile team to develop and improve SRE methodologies, support your peers, plan and self-improve What Will You Bring? 2+ years of experience operating production services on Kubernetes / OpenShift 2+ years of programming experience in Python, Go 1+ years of experience of using cloud providers and technologies (Google, Azure, Amazon, etc.) Solid understanding of Linux systems administration (RHEL/Fedora preferred) Understanding of standard networking (TCP/IP, DNS, HTTP/TLS) and authentication protocols ( LDAP ) Comfort with incident response, on-call responsibilities Ability to work in a team with minimal supervision while keeping the team informed The Following Are Considered a Plus Contributions to open source projects Experience with the Operator SDK or building Kubernetes operators Experience with GitOps workflows for managing infrastructure or application configuration Knowledge of SRE principles - SLOs, error budgets, toil measurement #LI-NG1 About Red Hat Red Hat is the world's leading provider of enterprise open source software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies. Spread across 40+ countries, our associates work flexibly across work environments, from in-office, to office-flex, to fully remote, depending on the requirements of their role. Red Hatters are encouraged to bring their best ideas, no matter their title or tenure. We're a leader in open source because of our open and inclusive environment. We hire creative, passionate people ready to contribute their ideas, help solve complex problems, and make an impact. Inclusion at Red Hat Red Hat's culture is built on the open source principles of transparency, collaboration, and inclusion, where the best ideas can come from anywhere and anyone. When this is realized, it empowers people from different backgrounds, perspectives, and experiences to come toge
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Red Hat? Share your experience