Learning image representations has been an interesting and challenging problem. When users upload images to photo sharing websites, they often provide multiple textual tags for ease of reference. These tags can reveal significant information about the content of the image such as the objects present in the image or the action that is taking place. Approaches have been proposed to extract additional information from these tags in order to augment the visual cues and build a multi-modal image representation. However, the existing approaches do not pay much attention to the semantic meaning of the tags while they encode. In this work, we attempt to enrich the image representation with the tag encodings that leverage their semantics. Our approa...
A long standing goal of artificial intelligence is to enable machines to perceive the visual world a...
Multi-modal distributional models learn grounded representations for improved performance in semanti...
In the recent years, the emergence of deep learning models has greatly advanced computer vision and ...
Learning image representations has been an interesting and challenging problem. When users upload im...
Images without annotations are ubiquitous on the Internet, and recommending tags for them has become...
International audienceWith the availability of massive amounts of digital images in personal and on-...
International audienceWith the availability of massive amounts of digital images in personal and on-...
We construct multi-modal concept repre-sentations by concatenating a skip-gram linguistic representa...
International audienceThis paper tackles two recent promising issues in the field of computer vision...
International audienceThis paper tackles two recent promising issues in the field of computer vision...
International audienceThis paper tackles two recent promising issues in the field of computer vision...
International audienceThis paper tackles two recent promising issues in the field of computer vision...
This paper studies the use of everyday words to describe images. The common saying has it that a pic...
© 1992-2012 IEEE. Automatically understanding and discriminating different users' liking for an imag...
© Springer International Publishing Switzerland 2015. Automatically understanding and modeling a use...
A long standing goal of artificial intelligence is to enable machines to perceive the visual world a...
Multi-modal distributional models learn grounded representations for improved performance in semanti...
In the recent years, the emergence of deep learning models has greatly advanced computer vision and ...
Learning image representations has been an interesting and challenging problem. When users upload im...
Images without annotations are ubiquitous on the Internet, and recommending tags for them has become...
International audienceWith the availability of massive amounts of digital images in personal and on-...
International audienceWith the availability of massive amounts of digital images in personal and on-...
We construct multi-modal concept repre-sentations by concatenating a skip-gram linguistic representa...
International audienceThis paper tackles two recent promising issues in the field of computer vision...
International audienceThis paper tackles two recent promising issues in the field of computer vision...
International audienceThis paper tackles two recent promising issues in the field of computer vision...
International audienceThis paper tackles two recent promising issues in the field of computer vision...
This paper studies the use of everyday words to describe images. The common saying has it that a pic...
© 1992-2012 IEEE. Automatically understanding and discriminating different users' liking for an imag...
© Springer International Publishing Switzerland 2015. Automatically understanding and modeling a use...
A long standing goal of artificial intelligence is to enable machines to perceive the visual world a...
Multi-modal distributional models learn grounded representations for improved performance in semanti...
In the recent years, the emergence of deep learning models has greatly advanced computer vision and ...