High Performance Computing Engineer

External

Harvarduniversity · Boston, MA

Full-timeOn-site1mo ago

AnsibleComplianceDockerDocumentationLinux

Cover Letter Connect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role

Requirements

Minimum of two years' post-secondary education or relevant work experience.
Additional Qualifications and Skills:
Bachelor's degree preferred.
Experience managing Linux-based systems in a research or academic environment.
Familiarity with workload schedulers (Slurm preferred), cluster provisioning, or performance tuning.
Experience with infrastructure monitoring, configuration management (e.g., Ansible), and containerization (e.g., Apptainer/Singularity, Docker).
Understanding of security and compliance frameworks relevant to research computing.
Strong troubleshooting, communication, and collaboration skills.
Ability to work in a team-oriented environment and adapt to evolving priorities.
Demonstrated service orientation and commitment to operational reliability.
Willingness to learn and grow technical depth in HPC tools and methodologies.
Effective time management and documentation habits.
Certificates and Licenses:
Completion of Harvard IT Academy specified foundational courses (or external equivalent) preferred.
Standard Hours/Schedule: 35 hours per week
Visa Sponsorship Information: Harvard University is unable to provide visa sponsorship for this position.
Pre-Employment Screening: Identity, Criminal
Staying Informed About Your Application: Due to the high volume of applications, we may not always be able to reach out right away, but you can track your status anytime through the Careers@Harvard portal.
#LI-DK1
Work Format Details
Salary Grade and Ranges
This position is salary grade level 057. Please visit Harvard's Salary Ranges to view the corresponding salary range and related information.

Benefits

Harvard offers a comprehensive benefits package that is designed to support a healthy work-life balance and your physical, mental and financial wellbeing. Because here, you are what matters. Our benefits include, but are not limited to:Generous paid time off including parental leaveMedical, dental, and vision health insurance coverage starting on day oneRetirement plans with university contributionsWellbeing and mental health resourcesSupport for families and caregiversProfessional development opportunities including tuition assistance and reimbursementCommuter benefits, discounts and campus perksLearn more about these and additional benefits on our Benefits & Wellbeing Page .EEO/Non-Discrimination Commitment StatementHarvard University is committed to equal opportunity and non-discrimination . We seek talent from all parts of society and the world, and we strive to ensure everyone at Harvard thrives. Our differences help our community advance Harvard's acadHealth insuranceDental insuranceVision insuranceParental leave

Additional Information

As a High-Performance Computing (HPC) Engineer, you will support the implementation, operation, and lifecycle management of secure and scalable HPC environments that enable computational research across HMS. Working as part of the Research Computing Infrastructure team, you will contribute to the provisioning and administration of compute clusters, workload scheduling systems such as Slurm, user-facing software environments, and secure platforms that meet institutional compliance needs. This role emphasizes hands-on technical execution, operational reliability, and collaboration with colleagues and researchers to support evolving HPC workflows and infrastructure. Core Duties: Perform provisioning, configuration, and decommissioning of HPC compute clusters. Support the administration and tuning of workload schedulers (e.g., Slurm) to ensure efficient job management and cluster utilization. Help maintain secure, regulated compute environments (e.g., NIST 800-171). Contribute to the integration of user accounts and identity management with institutional systems. Maintain and optimize user-facing software environments, including module systems and containerized applications. Support development and maintenance of scripts, automation, and tools used in cluster operations. Monitor system health, respond to alerts, and assist with compliance reporting and documentation. Collaborate with team members and researchers to troubleshoot and improve the computing environment. Contribute to operational documentation and support knowledge-sharing across the team. Participate in off-hours on-call rotation. Perform other duties as assigned.

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Harvarduniversity? Share your experience

Interested in this role?

Apply on the company's website.

Cover Letter Connect