Multimodal AI Intern (Audio:Visual AI) : PhD Level
Salaries : Competitive
Staines:upon:Thames
Join a leading AI research team driving innovation in next:generation mobile technologies. Were looking for a PhD:level intern or recent graduate to work on cutting:edge audio:visual AI solutions that will shape the future of smart, on:device intelligence.
Youll collaborate with world:class researchers and engineers, helping turn novel machine learning concepts into production:ready software for intelligent mobile platforms.
What You'll Do:
:Develop and prototype innovative solutions in multimodal on:device AI (audio + video).
:Research and implement methods such as contrastive learning, model compression, or multimodal LLMs.
:Tackle real:world challenges with efficient, scalable code using PyTorch or TensorFlow.
:Work within a high:impact team and contribute to research publications and internal reports.
What We're Looking For:
:PhD student or recent graduate in ML/AI, Computer Science, Engineering, or a related field.
:First:author publications in top AI/ML venues (CVPR, NeurIPS, ICML, ICLR, etc.).
:Strong skills in Python and/or C/C++, and hands:on experience with modern ML frameworks.
:Familiarity with Git and sound software engineering practices.
:Excellent communication and problem:solving abilities.
Bonus Points For:
:Experience in emotion recognition, foundational face models, or deception detection.
:Knowledge of multi:task learning, embedded AI, or distributed ML systems.
:Contributions to open:source ML libraries.
:Expertise in AI pipeline optimization and profiling.
If interested Apply