A key task to understand an image and its corresponding caption is not only to find out what is shown on the picture and described in the text, but also what is the exact relationship between these two elements. The long-term objective of our work is to be able to distinguish different types of relationship, including literal vs. non-literal usages, as well as finegrained non-literal usages (i.e., symbolic vs. iconic). Here, we approach this challenging problem by answering the question: ‘How can we quantify the degrees of similarity between the literal meanings expressed within images and their captions?’. We formulate this problem as a ranking task, where links between entities and potential regions are created and ranke...
The advent of digital photography calls for effective techniques for managing growing amounts of col...
This paper studies the use of everyday words to describe images. The common saying has it that a pic...
Visual attention mechanism has been widely used by image captioning model in order to dynamically at...
A key task to understand an image and its corresponding caption is not only to find out what is sh...
We investigate the problem of understanding the message (gist) conveyed by images and their caption...
We investigate the problem of understanding the message (gist) conveyed by images and their captions...
This paper describes a set of methods to link entities across images and text. As a corpus, we used ...
The message of news articles is often supported by the pointed use of iconic images. These images t...
none2The advent of digital photography calls for effective techniques for managing growing amounts o...
In this article, we describe a system that classifies relations between entities extracted from an i...
Texts and images provide alternative, yet orthogonal views of the same underlying cognitive concept....
This paper appeared in the AAAI-98 Workshop on Representations for Multi-Modal Human-Computer Inter...
Conceptual interpretation of languages has gathered peak interest in the world of artificial intelli...
This study aims at modeling the semantic similarity between metaphor terms by means of a distributio...
This study aims at modeling the semantic similarity between metaphor terms by means of a distributio...
The advent of digital photography calls for effective techniques for managing growing amounts of col...
This paper studies the use of everyday words to describe images. The common saying has it that a pic...
Visual attention mechanism has been widely used by image captioning model in order to dynamically at...
A key task to understand an image and its corresponding caption is not only to find out what is sh...
We investigate the problem of understanding the message (gist) conveyed by images and their caption...
We investigate the problem of understanding the message (gist) conveyed by images and their captions...
This paper describes a set of methods to link entities across images and text. As a corpus, we used ...
The message of news articles is often supported by the pointed use of iconic images. These images t...
none2The advent of digital photography calls for effective techniques for managing growing amounts o...
In this article, we describe a system that classifies relations between entities extracted from an i...
Texts and images provide alternative, yet orthogonal views of the same underlying cognitive concept....
This paper appeared in the AAAI-98 Workshop on Representations for Multi-Modal Human-Computer Inter...
Conceptual interpretation of languages has gathered peak interest in the world of artificial intelli...
This study aims at modeling the semantic similarity between metaphor terms by means of a distributio...
This study aims at modeling the semantic similarity between metaphor terms by means of a distributio...
The advent of digital photography calls for effective techniques for managing growing amounts of col...
This paper studies the use of everyday words to describe images. The common saying has it that a pic...
Visual attention mechanism has been widely used by image captioning model in order to dynamically at...