Data Scientist
Large Language Model Development
Location: Newcastle Upon Tyne
Apply now
What you can expect
Due to our continued growth in the field of Integrity Analytics, our Integrity Solutions Business requires a Data Scientist to work with the existing data science and engineering teams on internal R&D initiatives as well as a variety of client projects. ROSEN has developed a powerful new capability based on predictive analytics and a comprehensive database of corrosion features from pipelines around the world, called the Integrity Data Warehouse (IDW). ROSEN’s IDW holds data from over 26,000 in-line inspections, totalling over 1,000,000 km all around the world. This wealth of data gives ROSEN an unparalleled comprehensive picture of the condition of all pipeline assets in the world, ranging across all diameters, pressures and fluids. In addition, ROSEN is seeking to codify our understanding of pipeline integrity using fine-tuned and prompt engineered Large Language Models and has already developed significant capability in their use and deployment.
This role is focussed on development of digital training and certification solutions through the use of Large Language Models, specifically through Retrieval-Augmented Generation (RAG) and similar methods. Prior experience in these areas will be considered beneficial but is not necessary. The successful candidate would be based in Newcastle, UK and work collaboratively as part of a highly-capable data science team.
Responsibilities:
* Develop a generalizable framework for querying training material, documentation and the IDW using agentic LLMs, RAG and fine-tuning of open-source models
* Automate and validate the results of RAG queries to the IDW
* Implement APIs to deploy RAG functionality within the business, using frameworks such as FastAPI and Ollama servers
* Presentation of results and their impact to internal and external stakeholders
* Production of documentation and tests to ensure the maintainability of developed solutions
What you will bring
* Either completed or working towards a degree level qualification in a quantitative discipline such as Data Science, Computer Science or Mathematics
* Experience with programming languages such as Python, including the use of Git source control
* Familiarity with SQL for querying databases
* Knowledge of, or experience in, Large Language Models desirable
* Excellent communication skills and adaptability
* Ability to establish a good working relationship with other professionals, including with international colleagues
What we offer
* The role is full time and permanent
* Competitive salary and benefits, including 10% pension contribution, Vitality Health Insurance and bonus scheme
* Ongoing training & development opportunities
* Must be willing and able to travel nationally and internationally when required
* An excellent level of spoken and written English is essential for this position
Who we are
The ROSEN Group is a leading global provider of cutting-edge solutions in all areas of the integrity process chain. Since its beginnings as a one-man business in 1981, ROSEN has grown rapidly and is today a technology group that operates in more than 110 countries with over 4,000 highly qualified employees.
* Inspection of critical industrial assets to ensure reliable operations of the highest standards and effectiveness
* Customized engineering consultancy providing efficient asset integrity management
* Production and supply of customized novel products and systems
* Market-driven, topical state-of-the-art research and development providing “added-value” products and services
For more information about the ROSEN Group, go to www.rosen-group.com.
Do you have any questions?
Matt Attridge
Recruitment
#J-18808-Ljbffr