Our mission at Xyme is to solve important societal problems by revolutionizing the practice of synthetic chemistry through what we call xymes - AI-generated enzymes that can catalyze any reaction. As an innovative startup based in Oxford and Manchester, UK, we bring together interdisciplinary teams of scientists, engineers, and data specialists to push the boundaries of enzyme design. Our dynamic and collaborative work environment is fuelled by a passion for innovation. We cultivate a culture of continuous learning and improvement, where every team member can make a lasting impact on our groundbreaking research and real-world applications.
As a Senior Data Engineer specializing in Lab Informatics at Xyme, you will build and optimise the data infrastructure that powers our laboratory operations. You'll develop and maintain the digital foundations of our lab informatics systems, empowering our scientists to seamlessly capture, manage, and leverage experimental data in their work designing and creating novel enzymes. You'll play a pivotal role in accelerating our design-learning cycles and scientific discoveries by integrating laboratory results into our AI design platform. Working from the ground up, you'll shape our laboratory data systems, implementing solutions that directly advance our mission of revolutionising synthetic chemistry.
What you will do
Working alongside our lab scientific team to translate data requirements into technical solutions, your key responsibilities are to:
* Design, develop, and maintain robust data models, workflows and automations for our laboratory information management systems, including entity and sample registration, management, tracking and result ingestion.
* Create and optimise ELT pipelines to transform laboratory data into our centralized data warehouse for business intelligence and machine learning applications.
* Build and maintain automated systems for managing ontologies, controlled vocabularies, and data structures to ensure data quality and generate metadata for machine learning.
* Ensure data quality, security, and governance across all laboratory informatics data assets.
* Develop and maintain analytical applications, tools, and dashboards to support experimental scientists in their work and decision-making.
* Optimise our end-to-end DMTL (Design-Make-Test-Learn) pipeline by identifying bottlenecks and building applications and tools to reduce the time between design and learning phases.
Who we are looking for
* You are a seasoned data engineer with 3–5+ years in data modelling, building ETL/ELT pipelines, data integrations, and tools (Python, SQL).
* You have strong proficiency with cloud databases, data warehouses, orchestration tools, and analytics platforms (Postgres, Snowflake, dbt, Prefect/Dagster/ArgoWF, Streamlit/Superset).
* You have proven experience integrating and customizing LIMS platforms and scientific data management systems, as well as building custom Python applications and tools to streamline laboratory workflows and data handling.
* You have a proven track record of collaborating with laboratory scientists and data managers - understanding their requirements, translating needs into technical solutions, and providing ongoing support for data and analytical workflows.
* You are familiar with containerization and deployment tools, such as Docker, ECS/Fargate, Kubernetes, CI/CD pipelines, and GitHub Actions.
* You are familiar with cloud platforms, particularly AWS services including Lambda, ECS, SNS, SQS, EventBridge, and API Gateway.
What would be nice to have
* Experience working with biological and chemical data.
* Academic background in biology, chemistry, bioinformatics, or related fields that would strengthen collaboration with laboratory scientists and data managers.
* Experience in data science or machine learning to understand and support ML pipeline requirements.
* Experience working in a fast-paced startup environment.
What you will get from us
* An environment where you will have significant impact - your work will be critical in delivering a groundbreaking platform with the potential to solve some of the biggest challenges of our time: energy, climate, and health.
* Flexible hybrid work arrangement with 2-3 days per week at our Oxford office, complemented by remote working options that accommodate your preferences.
* We’ll help you advance your skills through comprehensive learning and development opportunities, including mentorship programs, access to industry conferences, specialised training sessions, and hands-on experience with cutting-edge technologies.
#J-18808-Ljbffr