Recent years have seen an explosion in multimodal data on the web. It is therefore important to perform multimodal learning to understand the web. However, it is challenging to join various modalities because each modality has a different representation and correlational structure. In addition, various modalities generally carry different kinds of information that may provide enrich understanding; for example, the visual signal of a flower may provide happiness; however, its scent might not be pleasant. Multimodal information may be useful to make an informed decision. Therefore, we focus on improving representations from individual modalities to enhance multimodal representation and learning. In this doctoral thesis, we presented technique...
Recent years have seen an explosion in multimodal data on the web. It is therefore important to perf...
Representation Learning is a significant and challenging task in multimodal learning. Effective moda...
Conference of 2016 ACM Workshop on Vision and Language Integration Meets Multimedia Fusion, Iv and L...
Abstract. The growth of multimedia content on the web raise diverse challenges. Over the last decade...
Creating a meaningful representation by fusing single modalities (e.g., text, images, or audio) is t...
Most machine learning applications involve a domain shift between data on which a model has initiall...
Most machine learning applications involve a domain shift between data on which a model has initiall...
Multimodal machine learning (MML) is a tempting multidisciplinary research area where heterogeneous ...
<p> Deep learning is skilled at learning representation from raw data, which are embedded in the se...
With the abundance of multimedia in web databases and the increasing user need for content of many m...
With the abundance of multimedia in web databases and the increasing user need for content of many m...
The growth of content on the web has raised various challenges, yet also provided numerous opportuni...
2019-01-29Multimodal reasoning focuses on learning the correlation between different modalities pres...
With the abundance of multimedia in web databases and the increasing user need for content of many m...
Recent years have seen an explosion in multimodal data on the web. It is therefore important to perf...
Recent years have seen an explosion in multimodal data on the web. It is therefore important to perf...
Representation Learning is a significant and challenging task in multimodal learning. Effective moda...
Conference of 2016 ACM Workshop on Vision and Language Integration Meets Multimedia Fusion, Iv and L...
Abstract. The growth of multimedia content on the web raise diverse challenges. Over the last decade...
Creating a meaningful representation by fusing single modalities (e.g., text, images, or audio) is t...
Most machine learning applications involve a domain shift between data on which a model has initiall...
Most machine learning applications involve a domain shift between data on which a model has initiall...
Multimodal machine learning (MML) is a tempting multidisciplinary research area where heterogeneous ...
<p> Deep learning is skilled at learning representation from raw data, which are embedded in the se...
With the abundance of multimedia in web databases and the increasing user need for content of many m...
With the abundance of multimedia in web databases and the increasing user need for content of many m...
The growth of content on the web has raised various challenges, yet also provided numerous opportuni...
2019-01-29Multimodal reasoning focuses on learning the correlation between different modalities pres...
With the abundance of multimedia in web databases and the increasing user need for content of many m...
Recent years have seen an explosion in multimodal data on the web. It is therefore important to perf...
Recent years have seen an explosion in multimodal data on the web. It is therefore important to perf...
Representation Learning is a significant and challenging task in multimodal learning. Effective moda...
Conference of 2016 ACM Workshop on Vision and Language Integration Meets Multimedia Fusion, Iv and L...