Multimodal datasets contain an enormous amount of relational information, which grows exponentially with the introduction of new modalities. Learning representations in such a scenario is inherently complex due to the presence of multiple heterogeneous information channels. These channels can encode both (a) inter-relations between the items of different modalities and (b) intra-relations between the items of the same modality. Encoding multimedia items into a continuous low-dimensional semantic space such that both types of relations are captured and preserved is extremely challenging, especially if the goal is a unified end-to-end learning framework. The two key challenges that need to be addressed are: 1) the framework must be able to me...
Data often consists of multiple diverse modalities. For example, images are tagged with textual info...
Recent years have witnessed a rise in real-world data captured with rich structural information that...
Deep learning has resulted in ground-breaking progress in a variety of domains, from core machine le...
This thesis investigates the potential of hypergraphs for capturing higher-order relations between o...
International audienceContinuous multimodal representations suitable for multimodal information retr...
<p> Deep learning is skilled at learning representation from raw data, which are embedded in the se...
Recent years have seen an explosion in multimodal data on the web. It is therefore important to perf...
Learning joint embedding space for various modalities is of vital importance for multimodal fusion. ...
In this paper, we present a hypergraph neural networks (HGNN) framework for data representation lear...
One of the key factors driving the success of machine learning for scene understanding is the develo...
Multi-modal distributional models learn grounded representations for improved performance in semanti...
Most of the networks we encounter in practice are Heterogeneous Information Networks (HINs), where i...
Representation learning aims to encode the relationships of research objects into low-dimensional, c...
Utilizing multimodal features to describe multimedia data is a natural way for accurate pattern reco...
Most machine learning applications involve a domain shift between data on which a model has initiall...
Data often consists of multiple diverse modalities. For example, images are tagged with textual info...
Recent years have witnessed a rise in real-world data captured with rich structural information that...
Deep learning has resulted in ground-breaking progress in a variety of domains, from core machine le...
This thesis investigates the potential of hypergraphs for capturing higher-order relations between o...
International audienceContinuous multimodal representations suitable for multimodal information retr...
<p> Deep learning is skilled at learning representation from raw data, which are embedded in the se...
Recent years have seen an explosion in multimodal data on the web. It is therefore important to perf...
Learning joint embedding space for various modalities is of vital importance for multimodal fusion. ...
In this paper, we present a hypergraph neural networks (HGNN) framework for data representation lear...
One of the key factors driving the success of machine learning for scene understanding is the develo...
Multi-modal distributional models learn grounded representations for improved performance in semanti...
Most of the networks we encounter in practice are Heterogeneous Information Networks (HINs), where i...
Representation learning aims to encode the relationships of research objects into low-dimensional, c...
Utilizing multimodal features to describe multimedia data is a natural way for accurate pattern reco...
Most machine learning applications involve a domain shift between data on which a model has initiall...
Data often consists of multiple diverse modalities. For example, images are tagged with textual info...
Recent years have witnessed a rise in real-world data captured with rich structural information that...
Deep learning has resulted in ground-breaking progress in a variety of domains, from core machine le...