Site Reliability Engineer - Senior - NordVPN Apps
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Benefits
Additional Information
The world's most advanced VPN, and a whole lot more. If you're a curious problem-solver who carves their own path, join the team behind Threat Protection Pro, the NordLynx protocol, and the fastest VPN on the planet-tools that put privacy, security, and control back in people's hands. Your impact? Helping millions take back control of their online security, privacy, and data. Main Responsibilities Together with a small, senior-leaning SRE team, run the Infrastructure & Services layer that our internal data organization, Data Analysts, Scientists, and Engineers, relies on day to day. Design, build, maintain, and document internal platforms with a heavy focus on automation, reliability, and scalability. Own the stability, availability, and security of our services (Minio, Spark, Kafka, OpenSearch, Tableau, and more). Engineer systems to be resilient by design, we don't run on-call rotations, we build solutions robust enough that they aren't needed. Support the services that Product teams depend on, such as OpenSearch for real-time analytics and Tableau for longer-term reporting. Work directly with Analysts, Scientists, and Engineers to debug their queries, code, and workflows. Drive cost optimization and capacity planning, scaling our 100+ bare-metal servers and PiB-scale storage to match demand. Diagnose hardware faults (failing disks, memory, etc.) and dispatch remote hands for the physical fixes, detection and decision-making are on us. Champion automation and engineering best practices across the department through ongoing R&D. Help shape the structure, tooling, and future direction of the team. Core Requirements 5+ years in Systems Engineering / SRE / DevOps, with demonstrated ownership of system design and architecture. 5+ years working with Linux/Debian in production. Python, Bash, and Git proficiency. Solid hands-on experience with containerization, everything we run is containerized. Production experience managing servers and services with Ansible. Experience with Grafana and observability tooling. Experience building and operating GitLab CI/CD pipelines. Comfortable debugging across the stack and across team boundaries, including other people's code and data workflows. A practical understanding of server hardware and how it fails, even without physical access. Excellent written and spoken English. A proactive, detail-oriented mindset and a genuine appetite for hard problems. Bonus Points For Experience operating any of our core services (Minio, Spark, Kafka, OpenSearch, Tableau). Mentoring or technical-leadership experience. Having a HomeLab. Language & Tools You Will Use Python Bash Git Docker Compose Ansible GitLab CI/CD Grafana OpenSearch, Kafka, Minio AI assistants
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at nord-security? Share your experience