Descriptive video service (DVS) provides linguistic de-scriptions of movies and allows visually impaired people to follow a movie along with their peers. Such descriptions are by design mainly visual and thus naturally form an inter-esting data source for computer vision and computational linguistics. In this work we propose a novel dataset which contains transcribed DVS, which is temporally aligned to full length HD movies. In addition we also collected the aligned movie scripts which have been used in prior work and compare the two different sources of descriptions. In total the Movie Description dataset contains a parallel cor-pus of over 54,000 sentences and video snippets from 72 HD movies. We characterize the dataset by benchmark-ing ...
Humans use rich natural language to describe and communicate visual perceptions. In order to provide...
Humans use rich natural language to describe and communicate visual perceptions. In order to provide...
Humans are entertained and emotionally captivated by a good story. Artworks, such as operas, theatre...
Audio Description (AD) provides linguistic descriptions of movies and allows visually impaired peopl...
Audio Description (AD) provides linguistic descriptions of movies and allows visually impaired peopl...
ABSTRACT: Audio description (AD) provides linguistic descriptions of movies and allows visually impa...
Description of the dataset , "This dataset comprises scripts collected from various sources, in...
Generating natural language descriptions for visual data links computer vision and computational lin...
Generating natural language descriptions for visual data links computer vision and computational lin...
ii Intelligent multimedia information retrieval deals with massive unstructured data, especially dig...
Intelligent multimedia information retrieval deals with massive unstructured data, especially digita...
Humans can easily describe what they see in a coherent way and at varying level of detail. However, ...
Humans use rich natural language to describe and com-municate visual perceptions. In order to provid...
Humans use rich natural language to describe and communicate visual perceptions. In order to provide...
Nowadays due to vast number of camera equipped devices, large amount of data in terms of image and v...
Humans use rich natural language to describe and communicate visual perceptions. In order to provide...
Humans use rich natural language to describe and communicate visual perceptions. In order to provide...
Humans are entertained and emotionally captivated by a good story. Artworks, such as operas, theatre...
Audio Description (AD) provides linguistic descriptions of movies and allows visually impaired peopl...
Audio Description (AD) provides linguistic descriptions of movies and allows visually impaired peopl...
ABSTRACT: Audio description (AD) provides linguistic descriptions of movies and allows visually impa...
Description of the dataset , "This dataset comprises scripts collected from various sources, in...
Generating natural language descriptions for visual data links computer vision and computational lin...
Generating natural language descriptions for visual data links computer vision and computational lin...
ii Intelligent multimedia information retrieval deals with massive unstructured data, especially dig...
Intelligent multimedia information retrieval deals with massive unstructured data, especially digita...
Humans can easily describe what they see in a coherent way and at varying level of detail. However, ...
Humans use rich natural language to describe and com-municate visual perceptions. In order to provid...
Humans use rich natural language to describe and communicate visual perceptions. In order to provide...
Nowadays due to vast number of camera equipped devices, large amount of data in terms of image and v...
Humans use rich natural language to describe and communicate visual perceptions. In order to provide...
Humans use rich natural language to describe and communicate visual perceptions. In order to provide...
Humans are entertained and emotionally captivated by a good story. Artworks, such as operas, theatre...