Position Summary
The purpose of the role is to conduct cutting-edge mathematical and applied research on foundations and challenges of Multimodal Artificial Intelligence (AI) to develop state-of-the-art solutions for real-world large-scale problems at Samsung Research and Development Institute, UK (SRUK). This is a good opportunity to work on next-generation mobile devices from Samsung. The research output will be disseminated in local and international research communities such as in research seminars, and top-tier conferences and journals.
As a forward-thinking company, we are at the forefront of innovation, and seek an individual with a passion for pushing the boundaries of Discriminative and Generative Multimodal AI processing text, image and video on PC and mobile environment. The prospective engineer will have an opportunity to work in production environments and solve industry-relevant problems.
Role and Responsibilities
1. Cutting-edge research to develop state-of-the-art solutions to existing problems and/or propose novel research challenges considering real-world case studies in Multimodal AI.
2. Development of high quality code with detailed documentation to support reproducible research in local and international research communities in Multimodal AI.
3. Publication in top-tier conferences and journals, such as NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, AAAI, ACL, STOC, IEEE TPAMI, IEEE TNNLS, IEEE TSP, IEEE TIP, Information Fusion, JAIR, IJRR, IJCV, and JMLR.
Skills and Qualifications
Essential:
4. Currently studying for a PhD in Computer Science, Engineering, Mathematics or a related discipline (or just completing).
5. Efficiency in elementary topics in mathematics (calculus, probability, statistics, linear algebra and optimization) and computer science (algorithms, data structures, parallel/distributed computing).
6. Experience in one or more general purpose programming languages including Python, Java, C and C++.
7. At least one first-author publication in one of the following conferences and journals: NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, AAAI, EMNLP, ACL, IEEE TPAMI, IEEE TNNLS, IJCV, and JMLR.
8. Experience and demonstrated output on Multimodal Foundation Models research involving Visual Language Models, Large Language Models, Vision Transformers, or Diffusion Models.
Additional:
9. Experience in software libraries and toolboxes such as PyTorch, Tensorflow, SciKits, Kaldi, OpenCV and CUDA.
10. Experience in software engineering and development.
11. Relevant work experience as an intern or a researcher in an R&D Lab. in academy or industry.
12. Ability to design and execute a research agenda.
Contract Type: 6 Month Contract (Inside IR35)
Job Location: Staines-upon-Thames, Surrey, UK
Hybrid Working: 3 days onsite and 2 days working from home weekly.
*