Skip to main content
Back to jobs

Senior Site Reliability Engineer - Observability Engineer - NordVPN

External
nord-security logoNord-security · Vilnius, Lithuania
Full-timeRemote2w ago
EpicGrafanaLinuxObservabilityPrometheusPython
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Benefits

Innovate with industry leadersWork alongside global experts to build world-leading cybersecurity tools, impacting millions of users around the world.Learn & growBoost your skills via our extensive training programs (online and offline) & other resources. Benefit from mentorship and career-switch opportunities to grow within the company.Work in a next-gen Cyber City officeThrive in our bustling office, featuring ergonomic workspaces, modern meeting rooms, engaging events, and specialty coffee to fuel your day.Hybrid workEnjoy the flexibility with 3 office days and working from home for the remaining 2.Work from anywhereRecharge with a change of scenery - choose work from any location when you feel a need to power your creativity and drive.Physical well-beingBoost your health with free-of-charge 24/7 gym access, onsite and online workouts, and consultations led by in-house Physical Well-Being experts.Mental & emotional healthNurture your mind with free psychologist consultations, dedicated mental health events, and premium access to top-rated wellness apps like Calm, Headspace, and Mindletic.Premium healthcareReceive private health insurance giving you peace of mind for your health needs.Extra days offEnjoy additional vacation days off as you grow with us. Plus, get extra days for sick leave, special occasions, or parenting needs.Joyful moments - special treatsCelebrate life's big moments with special gifts from us on your birthday, anniversary, and other major events, such as weddings or the arrival of a new family member.Company events & team-buildingExperience iconic Nord Security celebrations, team-buildings, and knowledge-sharing events, nurturing bonds that fuel our success.WorkationEmbark on a legendary company getaway abroad, filled with exciting activities, live concerts, engaging workshops, and epic time together.Kindly refer to our Privacy Notice for Recruitment Candidates for comprehensive information regarding our data handling procedures throughout recruitment processes.By submittHealth insurancePaid time offPerformance bonus

Additional Information

The world's most advanced VPN, and a whole lot more. If you're a curious problem-solver who carves their own path, join the team behind Threat Protection Pro, the NordLynx protocol, and the fastest VPN on the planet-tools that put privacy, security, and control back in people's hands. Your impact? Helping millions take back control of their online security, privacy, and data. NordVPN runs a global edge infrastructure serving millions of users. Knowing what's happening across that infrastructure - in real time, at scale, without drowning in noise - is what this role exists to solve. We're looking for a Senior Site Reliability Engineer (SRE) focused on observability: designing monitoring systems, improving signal quality, reducing alert fatigue, and collaborating with data teams on anomaly detection. You'll own how we understand the health and behavior of our distributed systems. Main responsibilities Design, build, and improve monitoring pipelines and observability tooling across globally distributed infrastructure Define and implement service-level monitoring based on golden signals (latency, traffic, errors, saturation) Reduce alert fatigue - build meaningful, actionable alerts that engineers trust Develop and maintain custom exporters, scripts, and integrations for metrics and log collection Collaborate with the data team on anomaly detection and data-driven operational insights Understand service signals - know what to measure, why, and what the numbers actually mean Core requirements Distributed systems observability - monitoring architecture, signal design, dashboarding Golden signal thinking - you design monitoring around what matters, not what's easy to measure Alert design - reducing noise, building actionable alerts, managing on-call sanity Python - scripting, custom exporters, automation, data processing Linux administration and debugging Networking fundamentals Bonus Points For SaltStack Advanced networking - traffic analysis, protocol-level debugging Advanced data knowledge - aggregation strategies, downsampling, cardinality management, retention trade-offs Proven track record of onboarding new systems/services into monitoring from scratch Familiarity with agentic engineering - Claude Code, LLM integrations, MCP workflows Tools You Will Use Naemon (Nagios) and Gearmand Prometheus-based exporters Telegraf Fluent Bit VictoriaMetrics ecosystem OpenSearch Grafana


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at nord-security? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect