Independently administer and maintain Linux-based servers, storage systems, and workstations supporting research, teaching, and administrative workloads.
Serve as a maintainer of the DSI HPC cluster, including compute nodes, scheduling infrastructure, storage, and supporting services.
Design, implement, and maintain configuration management and automation using tools such as Ansible and/or Puppet.
Plan and execute system upgrades, migrations, and large-scale infrastructure projects with minimal guidance.
Administer and troubleshoot storage systems (such as ZFS) and parallel or networked filesystems such as pNFS and Lustre.
Monitor system health, performance, and capacity; analyze logs and metrics to proactively identify and remediate issues.
Manage core infrastructure services including DNS, DHCP, TCP/IP networking, authentication, and access control.
Support and troubleshoot networking infrastructure, including switches and related hardware.
Administer Linux operating systems across multiple distributions, including Ubuntu and Red Hat-based systems.
Support endpoint management solutions such as SimpleMDM and manage macOS, windows and Linux workstation environments.
Provide technical leadership and mentoring to junior staff and assist with operational best practices.
Support and maintain audiovisual systems, including Zoom Rooms, displays, digital signage, and assembly or conference room technology.
Collaborate with faculty, researchers, students, and staff to gather requirements and deliver technical solutions that support research workflows.
Participate in incident response, disaster recovery planning, backups, and security remediation activities.
Maintain documentation for systems, procedures, and infrastructure.
Successfully collaborate and communicate with other system administrative team members and non-technical staff
Utilize virtualization and containerization technologies such as Docker, Nomad or Podman.
Configures, installs, upgrades, and maintains server applications and hardware. Works to safeguard the integrity of computer software. Implements operating system enhancements to improve the reliability and performance of the system.
Administers operating systems, maintains security, and implements backup procedures for the organization's information systems and peripheral equipment, such as servers, desktops, printers, and storage devices.
Perform other related duties as needed.
Requirements
Education:
Minimum requirements include a college or university degree in related field.
Work Experience:
Minimum requirements include knowledge and skills developed through 5-7 years of work experience in a related job discipline.
Certifications:
---
Bachelor's degree in computer science or a related field.
Significant hands-on experience administering Linux systems in production environments.
Deep knowledge of command line tools and systems.
Experience operating or supporting HPC or related systems, including compute r clusters, schedulers, and research workflows.
Strong experience with configuration management (Ansible, Puppet, or similar tools).
Proficiency in Python and Bash scripting for automation and systems tooling.
Experience with distributed and high-performance storage
Benefits
Health insuranceVision insurance
Additional Information
Department
PSD Data Science: Systems Administration
About the Department
The Data Science Institute (DSI) executes the University of Chicago's bold, innovative vision of Data Science as a new discipline. The DSI seeds research on the interdisciplinary frontiers of this emerging field, forms partnerships with industry, government, and social impact organizations, and supports holistic data science and AI education. The staff in the Data Science Institute support the University's students and scholars, their shared ideals, and the core values that make the University a singular intellectual destination.
Job Summary
The Data Science Institute (DSI) Tech staff is responsible for the design, operation, and security of all computing infrastructure within the Data Science Institute, including research computing systems, workstations, servers, storage, networking, and audiovisual environments. The Systems Administrator 3 is a mid-level systems professional who operates with minimal supervision and serves as a core technical contributor to the Institute's IT operations.
This role is responsible for independently administering, maintaining, and improving DSI's full IT environment, with a strong focus on high-performance computing (HPC) infrastructure. The Systems Administrator 3 collaborates closely with senior technical staff, faculty, researchers, and external partners, and is expected to take ownership of complex systems, lead projects, and contribute to architectural decisions.
The Systems Administrator 3 plays a key role in ensuring system reliability, performance, security, and scalability across the Institute's computing resources.