A leading global technology-driven firm is seeking a talented Production Engineer to join its High-Performance Computing (HPC) team based in London. This team plays a critical role in supporting cutting-edge quantitative research by designing and maintaining some of the most demanding compute and storage infrastructure in the industry. The environment is fast-paced and intellectually rigorous, built around a culture that encourages innovation, collaboration, and continuous improvement. Engineers here are empowered to solve complex technical problems, build highly customised systems at scale, and work side by side with researchers pushing the boundaries of science and computation. What You'll Do: Design and maintain high-performance computing and storage systems for large-scale research workloads Build tools for automating software deployment, configuration, and upgrades at scale Monitor and tune system, storage, and network performance across the HPC environment Write code to automate operational tasks and improve infrastructure reliability Collaborate closely with researchers to optimise their use of HPC resources What We're Looking For: 5 years of experience in HPC environments, including exposure to parallel file-systems (e.g., Lustre, GPFS), batch schedulers (e.g., Slurm, Grid Engine), and high-performance networking (experience with interconnects is a plus) Strong Linux systems administration skills in distributed and high-scale setups Proficiency in at least one language (e.g., Python, Go, C) for automation and tooling Experience building and supporting complex, interdependent systems Familiarity with configuration management tools like SaltStack, Ansible, or Puppet Proven experience managing large-scale, distributed infrastructure and solving complex performance issues