Développement de modèles multimodaux intéractifs pour l'apprentissage du language dans des environnements visuels

Strub, Florian

Publication date

January 2020

Publisher

HAL CCSD

Abstract

While our representation of the world is shaped by our perceptions, our languages, and our interactions, they have traditionally been distinct fields of study in machine learning. Fortunately, this partitioning started opening up with the recent advents of deep learning methods, which standardized raw feature extraction across communities. However, multimodal neural architectures are still at their beginning, and deep reinforcement learning is often limited to constrained environments. Yet, we ideally aim to develop large-scale multimodal and interactive models towards correctly apprehending the complexity of the world. As a first milestone, this thesis focuses on visually grounded language learning for three reasons (i) they are both well-...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Développement de modèles multimodaux intéractifs pour l'apprentissage du language dans des environnements visuels

Abstract

Extracted data

Développement de modèles multimodaux intéractifs pour l'apprentissage du language dans des environnements visuels

Abstract

Extracted data

Related items

Related items