* Collaborative engineering: Work within a larger team to rapidly develop proof-of-concept prototypes to validate research ideas and integrate them into production systems and infrastructure
* Performance Analysis: Conduct in-depth profiling and tuning of operating systems and large-scale distributed systems, leveraging heterogeneous hardware (CPU, NPU).
* Documentation and Reporting: Maintain clear technical documentation of research findings, design decisions, and implementation details to ensure reproducibility and facilitate knowledge transfer within the team.
* Research & Technology Exploration: Stay current with the latest advancements in AI infrastructure, cloud-native technologies, and operating systems. E.g. techniques to efficiently execute inference workload based on SW/HW co-design; exploit workload characteristics to prefetch memory/minimize communication.
* Stakeholder Communication: Present project milestones, performance metrics, and key findings to internal stakeholders.
This job description is only an outline of the tasks, responsibilities and outcomes required of the role. The jobholder will carry out any other duties as may be reasonably required by his/her line manager. The job description and personal specification may be reviewed on an ongoing basis in accordance with the changing needs of Huawei Research and Development UK Limited.
* Bachelor's or Master's degree in Computer Science or a related technical field.
* A solid background in operating systems and/or distributed systems and/or ML systems.
* Excellent programming skills, master of at least one language, such as C/C++.
* Good communication and teamwork skills.
* Be comfortable with research methodology.
Desired:
* Familiarity with current LLM architectures (e.g. Llama3, DeepSeek V3)
* Familiarity with production LLM serving systems and inference optimizations (e.g. VLLM)
* Experience with accelerator programming (e.g. CUDA, Triton) and communication libraries (e.g. NCCL)
Founded in 1987, Huawei is a leading global provider of information and communications technology (ICT) infrastructure and smart devices. We have 207,000 employees and operate in over 170 countries and regions, serving more than three billion people around the world.
Our vision and mission is to bring digital to every person, home and organization for a fully connected, intelligent world. To this end, we will drive ubiquitous connectivity and promote equal access to networks; bring cloud and artificial intelligence to all four corners of the earth to provide superior computing power where you need it, when you need it; build digital platforms to help all industries and organizations become more agile, efficient, and dynamic; redefine user experience with AI, making it more personalized for people in all aspects of their life, whether they're at home, in the office, or on the go. This spirit of innovation has led Huawei to work in close partnership with leading academic institutions in the UK to develop and refine the latest technologies. With a shared commitment to innovation and progress, both parties have worked together to achieve common goals and establish a strong partnership. The partnership between UK and Huawei help to develop the technologies of the future that will transform the way we all communicate, work and live. For the past 30 years we have maintained an unwavering focus, rejecting shortcuts and easy opportunities that don't align with our core business. With a practical approach to everything we do, we concentrate our efforts and invest patiently to drive technological breakthroughs., Huawei's vision is a fully connected, intelligent world. To achieve this, we work to inspire passion for basic research around the world. Our combined passion drives development across the global innovation value chain. Huawei has the largest Research and Development organization in the world with 96,000+ employees in research centers around the globe. In the UK, we already have design centers in Cambridge, London, Edinburgh and Ipswich. We continue to explore and define new research directions and new services. We have expanded our collaborations with academic researchers; researched new network architectures, integration of communications and key enabling technologies; and developed the fundamental theories of these technologies. We invite you to join us on this exciting journey and drive your career forward., The Systems Infrastructure Research (SIR) lab in Edinburgh is at the forefront of shaping the future of Huawei's data centre infrastructure. Our mission is to explore, prototype, and evaluate new technologies and designs, integrating them into Huawei's data centres to improve both internal operations and public cloud services. Our team's unique position allows us to bridge the gap between cutting-edge research and real-world engineering, turning the latest breakthroughs into production solutions. Huawei seeks a number of contractors with expertise in computer systems and AI infrastructure. We are looking to recruit people with a background in one or more of the following: operating systems, distributed systems and Machine Learning Systems. As a member of the SIR Lab, you will collaborate with leading researchers, tackle some of the most exciting challenges in systems and AI infrastructure, influence both academia and industry through innovative technologies, and form valuable partnerships with research teams across the globe.
* 33 days annual leave entitlement per year (including UK public holidays)
* Group Personal Pension
* Life insurance
* Private medical insurance
* Medical expense claim scheme
* Employee Assistance Program
* Cycle to work scheme
* Company sports club and social events
* Corporate retail discounts
* Flexible working
* Additional time off for learning and development