Conditional Neural Language Models for Multimodal Learning and Natural Language Understanding

Kiros, Jamie Ryan

Publication date

June 2018

Publisher

University of Toronto Medical Journal

Abstract

In this thesis we introduce conditional neural language models based on log-bilinear and recurrent neural networks with applications to multimodal learning and natural language understanding. We first introduce a LSTM encoder for learning visual-semantic embeddings for ranking the relevance of text to images in a joint embedding space. Next we introduce three log-bilinear models for generating image descriptions that integrate both additive and multiplicative interactions. Beyond image conditioning, we describe a multiplicative conditional neural language model for learning distributed representations of attributes and meta data. Our model allows for contextual word relatedness comparisons through decompositions of a word embedding tensor. ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Conditional Neural Language Models for Multimodal Learning and Natural Language Understanding

Abstract

Extracted data

Conditional Neural Language Models for Multimodal Learning and Natural Language Understanding

Abstract

Extracted data

Related items

Related items