Skip to main content
Back to jobs

Site Reliability Engineer

External
spectris logoSpectris · India
Full-timeRemoteToday
AgileAWSAzureBashCI/CDCompliance
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

This job will provide you with an opportunity to further your career alongside some of the best and most passionate technology experts from around the world in a leading company within the test, measurement and data analytics industry. You will be a strong contributor collaborating closely with colleagues from various business functions. At HBK, we live up to our three values: Be True, Own It and Aim High. We believe in absolute integrity - it's how we win for stakeholders, the environment and each other. We believe in teamwork and keeping our promises - to ourselves and others. Finally, we believe in being bold and positive. This is how we perform at our best and achieve greater success. Ideal Candidate: Proficiency in scripting and automation using tools such as Bash, Python & Go Expertise with CI/CD tools (e.g., GitHub Actions, TeamCity, Jenkins) Knowledge of infrastructure-as-code and control plane technologies such as Terraform, Pulumi, and Crossplane (including composition-based abstractions) Expertise with containerization technologies (Docker, Kubernetes) Experience with cloud platforms (AWS, Azure, GCP) Experience leveraging GenAI tools (e.g., GitHub Copilot, ChatGPT) to accelerate development and automation workflows Strong knowledge of SRE and DevOps principles, practices, and methodologies Experience in monitoring and observability tools (e.g., ELK, Grafana - Prometheus, Tempo, Loki) Experience building platform services using Python (APIs, CLI tools, or developer portals) Experience with Internal Developer Platforms (IDP) or self-service infrastructure platforms Understanding of platform engineering and developer experience (DevEx) principles Nice to have: Experience with DevSecOps, Threat Modelling Familiarity with incident response and post-incident analysis processes Strong troubleshooting and problem-solving skills Ability to work independently while also being a team player Experience working in an agile environment Actively propagate the SRE mindset by fostering a culture of reliability, automation, collaboration, and continuous improvement

Responsibilities

  • Design, build, and operate the internal developer platform (IDP), including portal and CLI interfaces backed by Python-based APIs, enabling consistent, self-service infrastructure provisioning via Crossplane abstractions
  • Develop and maintain cloud governance policies, procedures and standards
  • Offer guidance and recommendations on cloud governance best practices, with a focus on enhancing security and compliance measures
  • Monitor cloud spends to collate, analyze and prioritize cloud optimization recommendations to calculate potential savings
  • Collaborate with development teams to design, build, and maintain reliable and scalable systems
  • Participate in incident response processes and support production systems by triaging alerts and resolving operational issues
  • Define and improve service reliability metrics (SLIs/SLOs), including availability calculations using observability data (e.g., logs/metrics)
  • Implement and improve monitoring, alerting, and incident response processes to ensure system health and availability
  • Continuously identify and eliminate toil through automation (including use of GenAI where applicable)
  • Contribute to the development and improvement of CI/CD pipelines, deployment processes, and release strategies
  • Continuously improve the reliability of our systems through post-incident reviews and root cause analysis
  • Implement, execute and maintain Information Security Management System (ISMS) compliant with ISO 27001 standards
  • Stay current with industry trends, best practices, and emerging technologies related to Site Reliability Engineering
  • One company - HBK
  • Why Explore a Career at HBK
  • We offer competitive salaries as we recognize that our dedicated team members make us successful
  • Work From Anywhere (WFA) : We are a workplace that values work-life balance, provides flexible working hours, and provides full-time WFA option.
  • We provide attractive Health Insurance and Vacation benefits for our employees
  • You will be part of a team that is hi

Benefits

Health insuranceVision insurancePaid time offFlexible schedule

Additional Information

Site Reliability Engineer (SRE) at HBK


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at spectris? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect