Replicating the human ability to connect Vision and Language has recently been gaining a lot of attention in the Computer Vision and the Natural Language Processing communities. This research effort has resulted in algorithms that can retrieve images from textual descriptions and vice versa, when realistic images and sentences with simple semantics are employed and when paired training data is provided. In this paper, we go beyond these limitations and tackle the design of visual-semantic algorithms in the domain of the Digital Humanities. This setting not only advertises more complex visual and semantic structures but also features a significant lack of training data which makes the use of fully-supervised approaches infeasible. With this ...
154 pagesOver the course of the last decades, we have witnessed the significant progress of machine ...
Automated computer vision methods and tools offer new ways of analysing audio-visual material in the...
Powered by deep convolutional networks and large scale visual datasets, modern computer vision syste...
Replicating the human ability to connect Vision and Language has recently been gaining a lot of atte...
While several approaches to bring vision and language together are emerging, none of them has yet ad...
The applicability of computer vision to real paintings and artworks has been rarely investigated, ev...
It was a dream to make computers intelligent. Like humans who are capable of understanding informati...
Texts and images provide alternative, yet orthogonal views of the same underlying cognitive concept....
This paper reports steps in probing the artistic methods of figurative painters through computationa...
Visual-semantic embeddings have been extensively used as a powerful model for cross-modal retrieval ...
As vision and language techniques are widely applied to realistic images, there is a growing interes...
International audienceA tremendous number of techniques have been proposed to transfer artistic styl...
Semantic embeddings have advanced the state of the art for countless natural language processing tas...
We present a model that generates natural language de-scriptions of images and their regions. Our ap...
In this paper, the authors introduce PicNet, a Web-based system for augmenting semantic resources wi...
154 pagesOver the course of the last decades, we have witnessed the significant progress of machine ...
Automated computer vision methods and tools offer new ways of analysing audio-visual material in the...
Powered by deep convolutional networks and large scale visual datasets, modern computer vision syste...
Replicating the human ability to connect Vision and Language has recently been gaining a lot of atte...
While several approaches to bring vision and language together are emerging, none of them has yet ad...
The applicability of computer vision to real paintings and artworks has been rarely investigated, ev...
It was a dream to make computers intelligent. Like humans who are capable of understanding informati...
Texts and images provide alternative, yet orthogonal views of the same underlying cognitive concept....
This paper reports steps in probing the artistic methods of figurative painters through computationa...
Visual-semantic embeddings have been extensively used as a powerful model for cross-modal retrieval ...
As vision and language techniques are widely applied to realistic images, there is a growing interes...
International audienceA tremendous number of techniques have been proposed to transfer artistic styl...
Semantic embeddings have advanced the state of the art for countless natural language processing tas...
We present a model that generates natural language de-scriptions of images and their regions. Our ap...
In this paper, the authors introduce PicNet, a Web-based system for augmenting semantic resources wi...
154 pagesOver the course of the last decades, we have witnessed the significant progress of machine ...
Automated computer vision methods and tools offer new ways of analysing audio-visual material in the...
Powered by deep convolutional networks and large scale visual datasets, modern computer vision syste...