We build a joint multimodal model of text and images for automatically assigning illustrative images to journalistic articles. We approach the task as an unsupervised representation learning problem of finding a common representation that abstracts from individual modalities, inspired by multimodal Deep Boltzmann Machine of Srivastava and Salakhutdinov. We use state-of-the-art image content classification features obtained from the Convolutional Neural Network of Krizhevsky et al. as input "images" and entire documents instead of keywords as input texts. A deep learning and experiment management library Safire has been developed. We have not been able to create a successful retrieval system because of difficulties with training neural netwo...
This thesis is a compilation of five research contributions whose goal is to do unsupervised and tra...
Image classification is often solved as a machine-learning problem, where a classifier is first lear...
Automatic caption generation of images has gained significant interest. It gives rise to a lot of in...
We build a joint multimodal model of text and images for automatically assigning illustrative images...
Data often consists of multiple diverse modalities. For example, images are tagged with textual info...
A Deep Boltzmann Machine is described for learning a generative model of data that consists of multi...
Multimodal machine learning is a subfield of machine learning that aims to relate data from differen...
Matching two texts is a fundamental problem in many natural language processing tasks. An effective ...
Master's thesis in Computer scienceThe thesis is dedicated to the background linking tasks for news ...
Multimodal learning has received a lot of attention in the recent years. Associating a description ...
Topic modeling based on latent Dirichlet allocation (LDA) has been a framework of choice to deal wit...
Abstract Multimodal learning has been an important and challenging problem for decades, which aims ...
© 2019 Association for Computing Machinery. Performing direct matching among different modalities (l...
We propose an automated image selection system to assist photo editors in selecting suitable images ...
We study a novel multimodal-learning problem, which we call text matching: given an image containing...
This thesis is a compilation of five research contributions whose goal is to do unsupervised and tra...
Image classification is often solved as a machine-learning problem, where a classifier is first lear...
Automatic caption generation of images has gained significant interest. It gives rise to a lot of in...
We build a joint multimodal model of text and images for automatically assigning illustrative images...
Data often consists of multiple diverse modalities. For example, images are tagged with textual info...
A Deep Boltzmann Machine is described for learning a generative model of data that consists of multi...
Multimodal machine learning is a subfield of machine learning that aims to relate data from differen...
Matching two texts is a fundamental problem in many natural language processing tasks. An effective ...
Master's thesis in Computer scienceThe thesis is dedicated to the background linking tasks for news ...
Multimodal learning has received a lot of attention in the recent years. Associating a description ...
Topic modeling based on latent Dirichlet allocation (LDA) has been a framework of choice to deal wit...
Abstract Multimodal learning has been an important and challenging problem for decades, which aims ...
© 2019 Association for Computing Machinery. Performing direct matching among different modalities (l...
We propose an automated image selection system to assist photo editors in selecting suitable images ...
We study a novel multimodal-learning problem, which we call text matching: given an image containing...
This thesis is a compilation of five research contributions whose goal is to do unsupervised and tra...
Image classification is often solved as a machine-learning problem, where a classifier is first lear...
Automatic caption generation of images has gained significant interest. It gives rise to a lot of in...