UK [United Kingdom] - London City - London
Job Type: Permanent
Job Description: The AI Theory group at the London Research Center is seeking a Research Scientist to contribute to the development of the theoretical and algorithmic foundation for AIGC. The team is tackling ambitious projects focusing on AIGC theory and multimodal text-and-image/video generation. The ideal candidate should have experience in LLM and Multimodal-LLM, Generative Modelling (Diffusion Models, Auto-regressive Models, Transformers), and/or Efficient Learning (Data Efficiency, Optimization Theory).
The research scientist will conduct both academic and applied research. They will aim to advance the state-of-the-art in multimodal modelling with top-tier conference publications and will collaboratively develop advanced products and services with other groups in the company.
Key Responsibilities:
1. Leading and participating in cutting-edge research projects, focusing on the theoretical and algorithmic foundation of AIGC.
2. Devising, prototyping and patenting innovative solutions in collaboration with product teams.
3. External engagement: drafting academic publications, giving talks and collaborating with academic leaders.
4. Mentoring research interns and junior researchers.
This job description is only an outline of the tasks, responsibilities and outcomes required of the role. The jobholder will carry out any other duties as may be reasonably required by his/her line manager. The job description and personal specification may be reviewed on an ongoing basis in accordance with the changing needs of Huawei Research and Development UK Limited.
Person Specification:
Required:
1. PhD degree in computer science or related field; or equivalent research experience.
2. Hands-on experience with image/video generation and/or (multimodal) large language models.
3. Strong research track record. E.g., have published in top-tier conferences including NeurIPS, ICLR, ICML, CVPR, ICCV; and journals including JMLR and TPAMI, etc.
4. Strong coding skills: ability to quickly prototype in Python.
5. Excellent written and oral communication skills.
Desired:
1. Passionate about (multimodal) large language models, text-to-image/video model, controllable generation.
2. Experience mentoring other junior researchers.
3. Experience working in industry and collaborating with product groups.
4. Experience with PyTorch Profiler, Accelerate/DeepSpeed/Megatron/HuggingFace PEFT.
5. Hands-on experience with ViTs, (Multimodal)-LLMs, Diffusions, (Video) VAEs, etc.
#J-18808-Ljbffr