High Performance Computing Analyst required to help ensure HPC facilities meet availability, performance, and usability requirements.
The successful candidate will have a university degree and experience in:
1. Parallel processing computer systems including large-scale Linux clusters.
2. Unix or Linux systems administration, Unix shells, Python or Perl, and configuration management.
3. C, Fortran, MPI, and/or OpenMP programming.
High-Performance Computing facility running at least a hundred thousand jobs a day.
Main Duties and Key Responsibilities
1. Facilitate efficient use of HPC facilities support groups, developers, and end users with assistance, tools, and training.
2. Resolve user and operational problems with operating systems and HPC software stack.
3. Configure, test, tune, and go live of new HPC hardware.
4. Install, maintain, configure, and tune the operating system, high-performance interconnects, parallel filesystems, batch scheduling systems, standard utilities, user environment, and locally developed tools on the HPC facilities.
5. Continuously improve resiliency.
6. Provide on-site 24x7 monitoring staff with information, procedures, and training that they need.
7. Implement security for HPC systems.
#J-18808-Ljbffr