Abstract—In this paper, we address the problem of retrieving objects based on open-vocabulary natural language queries: Given a phrase describing a specific object, e.g., “the corn flakes box”, the task is to find the best match in a set of images containing candidate objects. When naming objects, humans tend to use natural language with rich semantics, including basic-level categories, fine-grained categories, and instance-level concepts such as brand names. Existing approaches to large-scale object recognition fail in this scenario, as they expect queries that map directly to a fixed set of pre-trained visual categories, e.g. ImageNet synset tags. We address this limitation by introducing a novel object retrieval method. Given a candidate...
In this paper, we describe a simple approach to learning models of visual object categories from ima...
We describe a model of object recognition as machine translation. In this model, recognition is a pr...
With the growth of open sensor networks, multiple applications in different domains make use of a la...
Existing object detection methods are bounded in a fixed-set vocabulary by costly labeled data. When...
Open-vocabulary object detection, which is concerned with the problem of detecting novel objects gui...
The task of open-vocabulary object-centric image retrieval involves the retrieval of images containi...
This thesis presents a novel paradigm known as the 'Pictorial Dictionary' for the content-based imag...
We propose an unsupervised method that, given a word, automatically selects non-abstract senses of t...
This thesis focuses on efficient and effective object retrieval from an unlabelled collection of ima...
We propose an approach to object recognition using vocabulary tree which, instead of finding the clo...
We describe building a large-scale image ontology using the WordNet lexical resources. This ontology...
This paper presents a novel search paradigm that uses mul-tiple images as input to perform semantic ...
Current approaches to object category recognition require datasets of training images to be manuall...
In the field of visual scene understanding, deep neural networks have made impressive advancements i...
We aim at advancing open-vocabulary object detection, which detects objects described by arbitrary t...
In this paper, we describe a simple approach to learning models of visual object categories from ima...
We describe a model of object recognition as machine translation. In this model, recognition is a pr...
With the growth of open sensor networks, multiple applications in different domains make use of a la...
Existing object detection methods are bounded in a fixed-set vocabulary by costly labeled data. When...
Open-vocabulary object detection, which is concerned with the problem of detecting novel objects gui...
The task of open-vocabulary object-centric image retrieval involves the retrieval of images containi...
This thesis presents a novel paradigm known as the 'Pictorial Dictionary' for the content-based imag...
We propose an unsupervised method that, given a word, automatically selects non-abstract senses of t...
This thesis focuses on efficient and effective object retrieval from an unlabelled collection of ima...
We propose an approach to object recognition using vocabulary tree which, instead of finding the clo...
We describe building a large-scale image ontology using the WordNet lexical resources. This ontology...
This paper presents a novel search paradigm that uses mul-tiple images as input to perform semantic ...
Current approaches to object category recognition require datasets of training images to be manuall...
In the field of visual scene understanding, deep neural networks have made impressive advancements i...
We aim at advancing open-vocabulary object detection, which detects objects described by arbitrary t...
In this paper, we describe a simple approach to learning models of visual object categories from ima...
We describe a model of object recognition as machine translation. In this model, recognition is a pr...
With the growth of open sensor networks, multiple applications in different domains make use of a la...