Audio Description (AD) provides linguistic descriptions of movies and allows visually impaired people to follow a movie along with their peers. Such descriptions are by design mainly visual and thus naturally form an interesting data source for computer vision and computational linguistics. In this work we propose a novel dataset which contains transcribed ADs, which are temporally aligned to full length movies. In addition we also collected and aligned movie scripts used in prior work and compare the two sources of descriptions. In total the Large Scale Movie Description Challenge (LSMDC) contains a parallel corpus of 118,114 sentences and video clips from 202 movies. First we characterize the dataset by benchmarking different approaches f...
The chapter opens with a definition of audio description (AD) \u2013 an accessible form of audiovisu...
International audienceIn this paper, we propose an audio-visual approach to video genre categorizati...
The internet hosts an immense reservoir of videos, witnessing a constant influx of thousands ofuploa...
Audio Description (AD) provides linguistic descriptions of movies and allows visually impaired peopl...
ABSTRACT: Audio description (AD) provides linguistic descriptions of movies and allows visually impa...
Descriptive video service (DVS) provides linguistic de-scriptions of movies and allows visually impa...
To make the content of moving images and audio-visual media available to a visually impaired audienc...
Generating natural language descriptions for visual data links computer vision and computational lin...
Generating natural language descriptions for visual data links computer vision and computational lin...
Abstract. Multimedia documents are increasingly used to disseminate specialized scientific knowledg...
This work deals with the representation of audiovisual information, to organize its content for futu...
This thesis explores the translation of mainstream film imagery in audio description (AD) for visual...
Humans can easily describe what they see in a coherent way and at varying level of detail. However, ...
| openaire: EC/H2020/780069/EU//MeMADThis chapter focuses on the recent surge of interest in automat...
Our objective in this work is long range understanding of the narrative structure of movies. Instead...
The chapter opens with a definition of audio description (AD) \u2013 an accessible form of audiovisu...
International audienceIn this paper, we propose an audio-visual approach to video genre categorizati...
The internet hosts an immense reservoir of videos, witnessing a constant influx of thousands ofuploa...
Audio Description (AD) provides linguistic descriptions of movies and allows visually impaired peopl...
ABSTRACT: Audio description (AD) provides linguistic descriptions of movies and allows visually impa...
Descriptive video service (DVS) provides linguistic de-scriptions of movies and allows visually impa...
To make the content of moving images and audio-visual media available to a visually impaired audienc...
Generating natural language descriptions for visual data links computer vision and computational lin...
Generating natural language descriptions for visual data links computer vision and computational lin...
Abstract. Multimedia documents are increasingly used to disseminate specialized scientific knowledg...
This work deals with the representation of audiovisual information, to organize its content for futu...
This thesis explores the translation of mainstream film imagery in audio description (AD) for visual...
Humans can easily describe what they see in a coherent way and at varying level of detail. However, ...
| openaire: EC/H2020/780069/EU//MeMADThis chapter focuses on the recent surge of interest in automat...
Our objective in this work is long range understanding of the narrative structure of movies. Instead...
The chapter opens with a definition of audio description (AD) \u2013 an accessible form of audiovisu...
International audienceIn this paper, we propose an audio-visual approach to video genre categorizati...
The internet hosts an immense reservoir of videos, witnessing a constant influx of thousands ofuploa...