USACares Jobs

Job Information

Michigan State University(MSU) HPCC System Administrator | Information Technologist II in East Lansing, Michigan

h3Working/Functional Title/h3 HPCC System Administrator h3Position Summary/h3 Michigan State Universityand#8217;s Institute for Cyber-Enabled Research is seeking an experienced HPC or Linux system architect or administrator (Information Technologist II) who will work with the ICER team to develop, support and optimize ICERand#8217;s High Performance Computing (HPC) systems using some of the technologies and skills described in the Desired Qualifications section. ICERand#8217;s HPC systems consist of over 1000 nodes, 50,000 cores, 400 GPUs, and 9 PB of storage and supports researchers in many different scientific domains across the campus and the state. Duties include: liparticipate in the design, development, operation, and support of High Performance Computing systems on campus;/li lidesign and support modern development and operations process and methodologies for researchers and internal ICER needs,/li liwork with other units on campus to create innovative solutions for University researchers,/li limanage complex projects that serve the entire university community as well as MSUand#8217;s external partners,/li liconsult with users to ensure the best use of the Universityand#8217;s computational resources,/li liparticipate in regional and national cyber infrastructure communities,/li liand other duties as assigned./li /ul This position includes an emergency on-call support rotation (compensated) with the rest of the systems team. h3Unit Specific Education/Experience/Skills/h3 Knowledge equivalent to that which normally would be acquired by completing a four-year college degree in Computer Science, Information Systems, or a related information technology field and three to five years of related and progressively more responsible or expansive work experience in Linux-based research computing or enterprise environments; or an equivalent combination of education and experience. h3Desired Qualifications/h3 Six years of experience with progressively increasing responsibility in research computing environments or complex Linux environments. Experience with HPC-specific technologies is preferred, but not required. Experience with technologies including: liRed Hat Enterprise Linux, CentOS, or derivatives and Linux services and technologies like dnsmasq, systemd, LDAP, PAM, sssd, OpenSSH, cgroups./li liParallel, network attached storage (Lustre, NFS, Spectrum Scale, Ceph)/li liScripting languages (including Bash, Python, or Perl)/li liVirtualization (ESXi or KVM/libvirt), containerization (Docker or Singularity), configuration management and automation (tools like xCAT, Puppet, kickstart) and orchestration (Kubernetes, docker-compose, CloudFormation, Terraform.)/li liHigh performance networking technologies (Ethernet and Infiniband) and hardware (Mellanox and Juniper),/li liLog aggregation and monitoring (Elastic Stack [ELK], Prometheus)/li liWeb development (Django, Sinatra, React, or Ruby)/li liBatch job queueing and scheduling systems (Slurm),/li liVersion control (git) and Continuous Integration (CI) or Continuous Deployment (CD)/li liCloud technologies (AWS S3, EC2, Lambda, ParallelCluster, Fargate; or Azure or GCP equivalents)/li liBuilding, troubleshooting, and supporting scientific software./li /ul The ideal client would have a history of demonstrated problem solving ability, teamwork, and adaptability to change; a commitment to mission-focused innovation; and continuous learning for self-improvement. h3Equal Employment Opportunity Statement/h3 All qualified applicants will receive consideration for e