Job description Site Name: Cambridge MA, UK - London Posted Date: Oct 25 2024 The Onyx Research Data Tech organization is GSK’s Research data ecosystem which has the capability to bring together, analyze, and power the exploration of data at scale. We partner with scientists across GSK to define and understand their challenges and develop tailored solutions that meet their needs. The goal is to ensure scientists have the right data and insights when they need it to give them a better starting point for and accelerate medical discovery. Ultimately, this helps us get ahead of disease in more predictive and powerful ways. Onyx is a full-stack shop consisting of product and portfolio leadership, data engineering, infrastructure and DevOps, data / metadata / knowledge platforms, and AI/ML and analysis platforms, all geared toward: Building a next-generation, metadata- and automation-driven data experience for GSK’s scientists, engineers, and decision-makers, increasing productivity and reducing time spent on “data mechanics”. Providing best-in-class AI/ML and data analysis environments to accelerate our predictive capabilities and attract top-tier talent. Aggressively engineering our data at scale, as one unified asset, to unlock the value of our unique collection of data and predictions in real-time. Our AIML & Scientific Computing Optimization team is focused on optimizing first-in-class Compute and AIML platforms that accelerate application development, scale up computational experiments, and integrate all computation with project metadata, logs, experiment configuration and performance tracking over abstractions that encompass Cloud and High-Performance Computing. This metadata-forward, CI/CD-driven platform represents and enables the entire application and analysis lifecycle including interactive development and explorations (notebooks), large-scale batch processing, observability and production application deployments. The optimization team’s focus is on maximizing scale and performance of all aspects of the platforms. A Director of AIML & Scientific Computing Optimization is a deeply technical leader. They consistently deliver major compute and AIML platform features and solutions with cross-organizational impact and value. They are recognized as expert in software engineering, scientific computing, and/or AIML with deep understanding of performance and optimization, within the Onyx team, across R&D Digital & Tech, and even externally. They can work closely with and have strong technical knowledge of – underlying platform dependencies such as DevOps, Infrastructure and Cloud and can enable collaborations and help drive the requirements across other Onyx engineering teams that results in improved performance and better user experience. This role is responsible for building and leading a team of world-class software engineers focused on optimizing a best-in-class Compute and AIML Platforms at scale, quality, and cost. The Director of AIML & Scientific Computing Optimization will support the Sr. Director of Computing, Analysis, and AI/ML Platforms in building a strong culture of accountability and ownership in their team, as well as instilling best-in-class engineering practices (e.g. testing, code reviews, DevOps-forward ways of working). They work in close partnership with AI/ML Platform team, Compute Platform team, Data and Infrastructure Engineering, Product Management, Portfolio Management, and other engineering functions to ensure close alignment with customers and with engineering teams both upstream and downstream of their work. Key Responsibilities: Build, lead, develop, and retain world-class software engineers Serve as a top expert for the optimization team, and contribute technical expertise to teams in closely aligned technical areas such as DevOps, Cloud and Infrastructure Lead design of major optimization software components of the Compute and AIML Platforms, contribute to development of production code and participate in both design reviews and PR reviews Accountable for delivery of scalable solutions to the Compute and AIML Platforms that supports the entire application lifecycle (interactive development and explorations/analysis, scalable batch processing, application deployment) with particular focus on performance at scale Partner with both AIML and Compute platform teams as well as scientific users to help optimize and scale scientific workflows by utilizing deep understanding of both software as well as underlying infrastructure (networking, storage, GPU architectures, …) Direct scrum team leads, and contribute technical expertise to teams in closely aligned technical areas Able to design innovative strategy and way of working to create a better environment for the end users, and able to construct a coordinated, stepwise plan to bring others along with the change curve Standard bearer for proper ways of working and engineering discipline, including CI/CD best practices and proactively spearhead improvement within their engineering area Serve as a technical thought leader and champion: e.g., speak at industry events, promote GSK as an attractive place to build a career and thrive as a Platform engineer, act as a key knowledge holder for the Onyx organization. Why You? Basic Qualifications: Bachelor’s, Master’s or PhD degree in Computer Science, Software Engineering, or related discipline. 8 years of experience using specialized knowledge in cloud computing, scalable parallel computing paradigms, software engineering, CI/CD with Bachelor’s. 6 years of experience using specialized knowledge in cloud computing, scalable parallel computing paradigms, software engineering, CI/CD with Master’s. 4 years of experience using specialized knowledge in cloud computing, scalable parallel computing paradigms, software engineering, CI/CD with a PhD. At least 2 years of experience with recruiting, managing, and developing engineers or other deeply technical contributors Preferred Qualifications: Deep experience using at least one interpreted and one compiled common industry programming language: e.g., Python, C/C++, Scala, Java, including toolchains for documentation, testing, and operations / observability Hands-on experience with application performance tuning and optimization, including in parallel and distributed computing paradigms and communication libraries such as MPI, OpenMP, Gloo, including deep understanding of the underlying systems (hardware, networks, storage) and their impact on application performance Expert understanding of AIML training optimization, including distributed multi-node training best practices and associated tools and libraries as well as hands-on practical experience in accelerating training jobs Understanding of ML model deployment strategies, including agent systems as well as scalable LLM model inference systems deployed in multi-GPU, multi-node environments. Deep expertise in modern software development tools / ways of working (e.g. git/GitHub, DevOps tools, metrics / monitoring, …) Cloud experience (e.g., AWS, Google Cloud, Azure), including infrastructure-as-code and relevant tools and libraries such as Terraform, Ansible, and Packer Experience with CI/CD implementations using git and a common CI/CD stack (e.g., Azure DevOps, CloudBuild, Jenkins, CircleCI, GitLab) Experience with Docker, Kubernetes, and the larger CNCF ecosystem including experience with application deployment tools such as Helm Experience with low level application builds tools (make, CMake) and understanding of optimization at the build and compile level Demonstrated excellence with agile software development environments using tools like Jira and Confluence Deep familiarity with the tools, techniques, optimizations in high-performance applications space, including engagement with the open-source community (and potentially making contributions to such tools) Experience with establishing software engineering ways of working and best practices for a team (whether informally or as formal SOPs etc) Experience recruiting top engineering talent Experience with agile planning and execution processes for software delivery GSKOnyx LI-GSK GSKTech1 Please visit GSK US Benefits Summary t o learn more about the comprehensive benefits program GSK offers US employees. Why GSK? Uniting science, technology and talent to get ahead of disease together. GSK is a global biopharma company with a special purpose – to unite science, technology and talent to get ahead of disease together – so we can positively impact the health of billions of people and deliver stronger, more sustainable shareholder returns – as an organisation where people can thrive. We prevent and treat disease with vaccines, specialty and general medicines. We focus on the science of the immune system and the use of new platform and data technologies, investing in four core therapeutic areas (infectious diseases, HIV, respiratory/ immunology and oncology). Our success absolutely depends on our people. While getting ahead of disease together is about our ambition for patients and shareholders, it’s also about making GSK a place where people can thrive. We want GSK to be a place where people feel inspired, encouraged and challenged to be the best they can be. A place where they can be themselves – feeling welcome, valued, and included. Where they can keep growing and look after their wellbeing. So, if you share our ambition, join us at this exciting moment in our journey to get Ahead Together. If you require an accommodation or other assistance to apply for a job at GSK, please contact the GSK Service Centre at 1-877-694-7547 (US Toll Free) or 1 801 567 5155 (outside US). GSK is an Equal Opportunity Employer and, in the US, we adhere to Affirmative Action principles. This ensures that all qualified applicants will receive equal consideration for employment without regard to race, color, national origin, religion, sex, pregnancy, marital status, sexual orientation, gender identity/expression, age, disability, genetic information, military service, covered/protected veteran status or any other federal, state or local protected class. Important notice to Employment businesses/ Agencies GSK does not accept referrals from employment businesses and/or employment agencies in respect of the vacancies posted on this site. All employment businesses/agencies are required to contact GSK's commercial and general procurement/human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business/ agency and GSK. In the absence of such written authorization being obtained any actions undertaken by the employment business/agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses/agencies in respect of the vacancies posted on this site. Please note that if you are a US Licensed Healthcare Professional or Healthcare Professional as defined by the laws of the state issuing your license, GSK may be required to capture and report expenses GSK incurs, on your behalf, in the event you are afforded an interview for employment. This capture of applicable transfers of value is necessary to ensure GSK’s compliance to all federal and state US Transparency requirements. For more information, please visit GSK’s Transparency Reporting For the Record site.