Role Overview: Cranleigh STEM has partnered with an exciting biotech start-up who are recruiting for a highly skilled Data Engineer (Cloud Infrastructure) based in Cambridge. In this role, you will play a crucial part in streamlining data flow across the organization by integrating data between teams. You will be responsible for overseeing data flow management and maintaining cloud infrastructure to support our sequencing projects and downstream data analysis. The successful candidate will manage the cloud infrastructure that supports these sequencing projects, enabling both clinical analytics and BI reporting. Main Duties & Responsibilities: Manage and optimize cloud resources to handle large-scale sequencing data Implement infrastructure improvements to enhance usability, performance, and security Automate data collection processes from laboratory instruments Use ETL processes to centralize organizational data Provide ad-hoc engineering support to laboratory and clinical teams Skills & Qualifications: Bachelor’s degree in a relevant technical field or equivalent experience Proficiency in Python, R, or other programming languages for data processing Experience managing and configuring cloud infrastructure and resources Knowledge of cloud security best practices Experience integrating APIs for process automation Familiarity with containerization tools (Docker, Singularity) Experience with schedulers such as AWS Batch, GCP Batch, or Slurm Proficiency in Git and version control Ability to critically assess data-handling practices in a commercial R&D setting Understanding of data management best practices Desirable Skills: Associate-level cloud certification Knowledge of SQL and relational databases Understanding of UK GDPR requirements for processing human genomics data