There has been continuous growth in the volume and ubiquity of video material. It has become essential to define video semantics in order to aid the searchability and retrieval of this data. We present a framework that produces textual descriptions of video, based on the visual semantic content. Detected action classes rendered as verbs, participant objects converted to noun phrases, visual properties of detected objects rendered as adjectives and spatial relations between objects rendered as prepositions. Further, in cases of zero-shot action recognition, a language model is used to infer a missing verb, aided by the detection of objects and scene settings. These extracted features are converted into textual descriptions using a template-b...
Humans use rich natural language to describe and communicate visual perceptions. In order to provide...
We present a holistic data-driven technique that generates natural-language descriptions for videos....
Linking natural language to visual data is an important topic at the intersection of Natural Languag...
This contribution addresses generation of natural language descriptions for important visual content...
This thesis is concerned with the automatic generation of natural language descriptions that can be ...
Generating natural language descriptions for visual data links computer vision and computational lin...
This paper integrates techniques in natural language processing and computer vision to improve recog...
This paper integrates techniques in natural language processing and computer vision to improve recog...
Generating natural language descriptions for visual data links computer vision and computational lin...
This paper integrates techniques in natural language processing and computer vision to improve recog...
This exploratory work is concerned with generation of natural language descriptions that can be used...
Humans use rich natural language to describe and com-municate visual perceptions. In order to provid...
Humans use rich natural language to describe and com-municate visual perceptions. In order to provid...
Humans use rich natural language to describe and communicate visual perceptions. In order to provide...
Humans use rich natural language to describe and communicate visual perceptions. In order to provide...
Humans use rich natural language to describe and communicate visual perceptions. In order to provide...
We present a holistic data-driven technique that generates natural-language descriptions for videos....
Linking natural language to visual data is an important topic at the intersection of Natural Languag...
This contribution addresses generation of natural language descriptions for important visual content...
This thesis is concerned with the automatic generation of natural language descriptions that can be ...
Generating natural language descriptions for visual data links computer vision and computational lin...
This paper integrates techniques in natural language processing and computer vision to improve recog...
This paper integrates techniques in natural language processing and computer vision to improve recog...
Generating natural language descriptions for visual data links computer vision and computational lin...
This paper integrates techniques in natural language processing and computer vision to improve recog...
This exploratory work is concerned with generation of natural language descriptions that can be used...
Humans use rich natural language to describe and com-municate visual perceptions. In order to provid...
Humans use rich natural language to describe and com-municate visual perceptions. In order to provid...
Humans use rich natural language to describe and communicate visual perceptions. In order to provide...
Humans use rich natural language to describe and communicate visual perceptions. In order to provide...
Humans use rich natural language to describe and communicate visual perceptions. In order to provide...
We present a holistic data-driven technique that generates natural-language descriptions for videos....
Linking natural language to visual data is an important topic at the intersection of Natural Languag...