Data Engineer Junior and Senior Roles available About Groundtruth AI Groundtruth AI is a newly founded and growing start-up in 2024. We are a Google Cloud partner working to help major financial institutions transform the way they find and fight financial crime. Our founders have worked with Google for years and were key figures in shaping and building Google’s latest Cloud product targeting Anti-Money Laundering. We exist to deploy technologies that make a measurable difference in tackling financial crime. The billions of dollars stolen and laundered each year mask untold human suffering which we can help prevent. Who are we looking for? We are looking for data engineers to build and deploy repeatable data and machine learning pipelines, webapps and systems around AI products to our rapidly expanding portfolio of customers and into their GCP infrastructure. You’ll be involved and interested in the end-to-end delivery of the system. Exploring, understanding and processing data, designing and building pipelines, understanding model outputs and evaluating performance against defined objectives, and communicating these results to the rest of the team. A proactive and driven approach to problem solving and solutionizing is key. As one of our first employees, you will have a key role in shaping our direction and decision making. We are looking for people to help us rapidly scale and for future leaders as we expand and recruit further. You will have a demonstrable track record of getting things done in environments where the objectives are sometimes ambiguous. You will be comfortable with working with novel technologies and techniques as you go along, and owning a problem from end to end. Our culture We are a young, small company with an engineering led approach and a focus on delivering high quality software. We value attitude, collaboration, respect for others and taking proactive actions alongside engineering expertise. You don’t need to be an expert in AI or financial crime to succeed, but you do need the intellectual curiosity to learn more. Experience We’re recruiting candidates with a range of experience levels, the below is the minimum for what we would expect for a candidate. More senior candidates may have far more experience. We value experience but accept proven ability for junior positions where appropriate. 3 years experience / proven ability of: Delivering software into production environments with an emphasis on data processing or MLOps. Working as part of a development team with version control technologies Experience developing data transformations on large scale data platforms, either relational or non-relational. Ad-hoc data analysis and data exploration Experience debugging data processes, resolving and articulating problems with data and performance optimization. Solving and implementing practical strategies for system and architecture design. (Preferred) Client facing software delivery experience Tech stack We work with a wide range of technologies and have no expectation for you to have experience of all of them. Proficiency with at least one programming language - SQL, python, rust, typescript Proficiency with data manipulation languages - SQL, python Data platform and pipeline technologies - bigquery, apache ibis, sqlglot, DBT. Cloud - Google Cloud Platform, AML AI Data pipeline and management - bigquery Version control/CI/CD - git, github actions Webapps - react, next.js Responsibilities: Own the deployment of models and solutions on client environments. Drive the development of robust, repeatable and deployable data and MLOps pipelines to tune, train and predict. Creatively adapt to a range of different client technologies. Communicate progress, outcomes and blockers clearly Work with the co-founders, prioritise and implement additional data and features to improve our success metrics. Education Quantitative ability, either through a formal education in a quantitative subject or equivalent experience Knowledge of designing and evaluating models and systems, either through education or equivalent experience. Language English fluency essential Spanish desirable Benefits Hybrid Working - 1/2 days in the office in London. £55k-£105k (depending on seniority) Pension contributions - 3% match Bonus up to 15% of base salary, dependent on company performance 25 days holiday Read more about us at www.groundtruthai.net