Skip to main content
Back to jobs

HPC Systems Administrator

External
helsing logoHelsing · Munich, Germany
Full-timeOn-site1w ago
AnsibleBashComplianceDocumentationGenerative AIIncident Response
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

Helsing is a defence AI company. Our mission is to protect our democracies. We aim to achieve technological leadership, so that open societies can continue to make sovereign decisions and control their ethical standards. As democracies, we believe we have a special responsibility to be thoughtful about the development and deployment of powerful technologies like AI. We take this responsibility seriously. We are an ambitious and committed team of engineers, AI specialists and customer-facing programme managers. We are looking for mission-driven people to join our European teams - and apply their skills to solve the most complex and impactful problems. We embrace an open and transparent culture that welcomes healthy debates on the use of technology in defence, its benefits, and its ethical implications. Helsing operates on-premises high-performance computing (HPC) infrastructure that supports electromagnetics, computational fluid dynamics, and multi-physics simulation. As an HPC Systems Administrator based in Munich, you will take ownership of this critical environment, ensuring that our team of simulation engineers remains unblocked, productive, and equipped to solve complex problems. You will play a vital role in maintaining rigorous technical standards, optimising compute resources, and scaling our infrastructure to support continuous, large-scale modelling. The role is based on-site in Munich with regular travel to our Tussenhausen site. The day-to-day Own the day-to-day administration of compute nodes, workload schedulers, parallel storage, high-speed interconnects, and licence servers Ensure the environment remains highly available and consistently performant through proactive monitoring, patching, firmware updates, and incident response Administer the workload scheduler (Slurm, PBS Pro, or similar), managing queues, fair-share policies, accounting, and quotas to optimise resource utilisation Manage the simulation software stack and user environments using tools such as Lmod, Spack, or EasyBuild Collaborate with hardware and software vendors to resolve support cases, process RMAs, and ensure upgrade quality Automate operational workflows using Bash, Python, and Ansible to improve system efficiency and reduce manual intervention Maintain the strict security posture required for cleared work and support ongoing compliance reviews Onboard users and maintain comprehensive documentation to empower engineers to self-serve You should apply if you have administered Linux systems within a production HPC or large shared compute environment have hands-on experience managing workload schedulers such as Slurm, PBS Pro, LSF, or similar possess production experience with parallel filesystems (Lustre, BeeGFS, or GPFS) and high-speed interconnects (InfiniBand or RoCE) are capable of scripting and automating complex workflows with Bash, Python, and Ansible can effectively manage commercial simulation software (CAE, CFD, or EM) and licence servers, including FlexLM Note: We operate in an industry where women, as well as other minority groups, are systematically under-represented. We encourage you to apply even if you don't meet all the listed qualifications; ability and impact cannot be summarised in a few bullet points.

Requirements

  • Experience administering Altair or Siemens simulation suites
  • A background working within classified or strictly regulated environments
  • Expertise in GPU computing, including CUDA and NVIDIA toolchains, alongside MPI stack management
  • Familiarity with HPC containers using Apptainer, Enroot, or Pyxis
  • Experience with identity management systems such as Keycloak, FreeIPA, Active Directory, or Kerberos
  • Competence in infrastructure-as-code practices using tools such as Terraform
  • Join Helsing and work with world-leading experts in their fields
  • Helsing's work is important. You'll be directly contributing to the protection of democratic countries while balancing both ethical and geopolitical concerns
  • The work is unique. We operate in a domain that has highly unusual technical requirements and constraints, and where robustness, safety, and ethical considerations are vital. You will face unique Engineering and AI challenges that make a meaningful impact in the world
  • In our domain, success is a matter of order-of-magnitude improvements and novel capabilities. This means we take bets, aim high, and focus on big opportunities. Despite being a relatively young company, Helsing has already been selected for multiple significant government contracts
  • We actively encourage

Benefits

Health insurance

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at helsing? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect