This dissertation delves into the use of textual metadata for image understanding. We seek to exploit this additional textual information as weak supervision to improve the learning of recognition models. There is a recent and growing interest for methods that exploit such data because they can potentially alleviate the need for manual annotation, which is a costly and time-consuming process. We focus on two types of visual data with associated textual information. First, we exploit news images that come with descriptive captions to address several face related tasks, including face verification, which is the task of deciding whether two images depict the same individual, and face naming, the problem of associating faces in a data set to th...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Recent technological advances in the acquisition of multimedia data have led to an exponential growt...
The rapid growth of Internet and multimedia information has shown a need in the development of multi...
This dissertation delves into the use of textual metadata for image understanding. We seek to exploi...
We are currently experiencing an exceptional growth of visual data, for example, millions of photos ...
International audienceIn this paper, we present methods for face recognition using a collection of i...
As larger multimodal datasets are becoming available on the web, the possibility for better, more hu...
Visual recognition is a fundamental research topic in computer vision. This dissertation explores d...
This thesis explores how machine learning can be applied to the task of learning to recognise visual...
University of Technology Sydney. Faculty of Engineering and Information Technology.Nowadays, images ...
Tremendous amounts of visual data are produced every day, such as user-generated images and videos f...
This dissertation addresses the problem of describing images using visual attributes and textual tag...
This paper presents a novel approach for automatically generating image descriptions: visual detecto...
While text generated by current vision-language models may be accurate and syntactically correct, it...
National audienceAnnotating images using a fixed number of concepts is a fundamental task for conten...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Recent technological advances in the acquisition of multimedia data have led to an exponential growt...
The rapid growth of Internet and multimedia information has shown a need in the development of multi...
This dissertation delves into the use of textual metadata for image understanding. We seek to exploi...
We are currently experiencing an exceptional growth of visual data, for example, millions of photos ...
International audienceIn this paper, we present methods for face recognition using a collection of i...
As larger multimodal datasets are becoming available on the web, the possibility for better, more hu...
Visual recognition is a fundamental research topic in computer vision. This dissertation explores d...
This thesis explores how machine learning can be applied to the task of learning to recognise visual...
University of Technology Sydney. Faculty of Engineering and Information Technology.Nowadays, images ...
Tremendous amounts of visual data are produced every day, such as user-generated images and videos f...
This dissertation addresses the problem of describing images using visual attributes and textual tag...
This paper presents a novel approach for automatically generating image descriptions: visual detecto...
While text generated by current vision-language models may be accurate and syntactically correct, it...
National audienceAnnotating images using a fixed number of concepts is a fundamental task for conten...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Recent technological advances in the acquisition of multimedia data have led to an exponential growt...
The rapid growth of Internet and multimedia information has shown a need in the development of multi...