Multimodal AI Intern

Apply

Location

UK - West London

Workplace

On-Site

Job Type

Permanent

Job Description

Multimodal AI Intern (Audio-Visual AI) – PhD Level

Salaries – Competitive

Staines-upon-Thames

Join a leading AI research team driving innovation in next-generation mobile technologies. We're looking for a PhD-level intern or recent graduate to work on cutting-edge audio-visual AI solutions that will shape the future of smart, on-device intelligence.

You'll collaborate with world-class researchers and engineers, helping turn novel machine learning concepts into production-ready software for intelligent mobile platforms.

What You’ll Do:

Develop and prototype innovative solutions in multimodal on-device AI (audio + video).
Research and implement methods such as contrastive learning, model compression, or multimodal LLMs.
Tackle real-world challenges with efficient, scalable code using PyTorch or TensorFlow.
Work within a high-impact team and contribute to research publications and internal reports.

What We’re Looking For:

PhD student or recent graduate in ML/AI, Computer Science, Engineering, or a related field.
First-author publications in top AI/ML venues (CVPR, NeurIPS, ICML, ICLR, etc.).
Strong skills in Python and/or C/C++, and hands-on experience with modern ML frameworks.
Familiarity with Git and sound software engineering practices.
Excellent communication and problem-solving abilities.

Bonus Points For:

Experience in emotion recognition, foundational face models, or deception detection.
Knowledge of multi-task learning, embedded AI, or distributed ML systems.
Contributions to open-source ML libraries.
Expertise in AI pipeline optimization and profiling.

If interested please email Trei@microtech-global.com

Job Reference

19003UCE

Categorisation

Software & Artificial Intelligence:

A.I. / Machine Learning

Contact Details

T: +44 1628 206 213

E: tech@microTECH-global.com

Apply

Multimodal AI Intern

Search Jobs

Options

Apply