Are you passionate about high-performance computing (HPC) and ready to shape the future of cloud GPU infrastructure? A global leader in providing on-demand and reserved cloud GPUs for AI, machine learning, and HPC workloads urgently needs a Contractor HPC Support Engineer.
If you thrive on cutting-edge technologies and scalable systems, this is ideal for you.
We are looking for candidates with HPC Support experience, focused on HPC system administration.
You should have experience with:
1. Network technologies such as Infiniband and RoCE
2. Storage technologies such as Vast or Weka
3. Job schedulers such as SLURM or Kubernetes
4. Hypervisors such as Proxmox or VMWare
Knowledge of HPC-AI cluster support is essential, and some exposure to hardware support would be a positive, but not a focus in this role. The primary goal is to access the environment remotely, check the monitoring dashboards, and review logs and messages to identify issues. You must be accustomed to tight SLAs and supporting large clusters of hundreds of servers.
This role is 100% remote in the UK, working PST (Pacific Time) (GMT (Apply online only)), Monday to Friday, for 3 months, outside IR35.
To know more about this great opportunity, please call Giles Murphy at Hays ASAP!
#J-18808-Ljbffr