Network Operations Centre Engineer

External

Myhcm · Cape Town, South Africa

Full-timeOn-site2w ago

AWSAzureBashCachingDatadogDNS

Cover Letter Connect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role

About the role

We're part of Super Group, the NYSE-listed digital gaming company behind some of the world's leading Sports and iGaming brands. At DigiOutsource, we bring passionate people and innovative tech together to create market-leading online gaming solutions. Our multidisciplinary teams are passionate about products, customer experience and security. We're empowered to achieve the ultimate in high-performance gaming experiences using the best technology available. Who we're looking for We're on a thrilling journey of growth and innovation, and we need passionate, driven individuals to join us. At DigiOutsource, every day is action-packed, and we expect you to bring your A-game. In return, you'll find a supportive environment where your skills can flourish and your career can soar. Ready to become a game-changer? Supercharge your career with us and be part of something extraordinary. Why we need you We're building experiences that wow our customers and that starts with bold, curious people who want to do work that matters. If you're hungry to grow, excited by impact and ready for a challenge that will supercharge your career, this could be your moment. As a Network Operations Centre Engineer, you will play a key role in delivering a high‑quality, resilient, and proactive operations service. You will take ownership of real‑time platform monitoring, advanced incident detection, and independent triage. You will perform second‑line troubleshooting, drive effective escalation paths, coordinate with engineering teams, and contribute to rapid service restoration during live events. You will also be responsible for producing accurate operational documentation and improving incident processes. Your work directly supports platform stability and ensures seamless customer experiences, especially during peak sporting moments where reliability is critical.

Responsibilities

You'll take ownership of work that gives us our competitive edge, including:
Monitoring & Observability Using monitoring tools such as Grafana, Datadog, SolarWinds, and Nagios to interpret dashboards, review alerts, and identify abnormal performance patterns or traffic deviations.
Correlating real‑time metrics, logs, and telemetry to detect system health concerns and escalate appropriately.
Networking & Platform Operations Applying solid understanding of TCP/IP, DNS, HTTP, TLS, load balancing, and CDNs to support troubleshooting of platform issues.
Using working knowledge of distributed systems, caching, and messaging components to assist with fault isolation and impact assessment during incidents.
Incident Management Tooling Using Jira for structured incident tracking, escalation, and resolution workflows
Operating on‑call platforms such as PagerDuty and maintaining knowledge base/runbook documentation for consistent incident response
Diagnostics & Troubleshooting Performing first‑pass triage on server health, application performance, API latency, and database connectivity (e.g., SQL reachability, connection pooling)
Analysing logs, metrics, and system indicators to narrow down root‑cause direction during high‑pressure incidents
Scripting & Automation Using basic Bash, Python, or PowerShell scripts for log extraction, parsing, or system checks.
Assisting with automating recurring operational tasks to reduce manual effort and improve consistency.
Cloud & Container Technologies Understanding cloud fundamentals in AWS, Azure, or GCP to support cloud‑based troubleshooting or triage.
Basic exposure to Docker, Kubernetes, or log stacks such as ELK/Opensearch, Splunk, or Loki/Promtail to aid with diagnosing distributed workloads.
This list covers your core responsibilities with plenty of room to stretch, explore and take on new challenges as we grow.

Requirements

You're someone who brings:
Clear, confident communication (written and verbal), and the ability to breakdown complex ideas
A collaborative mindset, working smoothly with cross‑functional teams to hit shared goals
Strong organisational skills and the ability to manage multiple projects without dropping the ball
Exceptional attention to detail and a commitment to high‑quality work
Adaptability - you stay sharp, productive and positive in fast‑moving environments
A relevant IT qualification or industry certification, such as CompTIA A+ / Network+ or Cisco CCNA, or equivalent intermediate‑level technical certification.
3 - 5 years' experience in an Operations, NOC, or Incident Management environment with a focus on real‑time monitoring, incident detection, and structured escalation.
3 - 5 years' hands‑on experience using at least one major monitoring platform (e.g., Nagios, SolarWinds, Datadog, Grafana, Zabbix), including alert interpretation and basic correlation.
3 - 5 years' experience using enterprise ITSM tools such as Jira, ServiceNow, or Freshservice.
Practical expos

Benefits

Health insurance

Additional Information

Kick-start your career in the online gaming world and experience the very latest in technology and innovation.

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at myhcm? Share your experience

Interested in this role?

Apply on the company's website.

Cover Letter Connect