textRecognizing activities in real-world videos is a difficult problem exacerbated by background clutter, changes in camera angle & zoom, occlusion and rapid camera movements. Large corpora of labeled videos can be used to train automated activity recognition systems, but this requires expensive human labor and time. This thesis explores how closed captions that naturally accompany many videos can act as weak supervision that allows automatically collecting 'labeled' data for activity recognition. We show that such an approach can improve activity retrieval in soccer videos. Our system requires no manual labeling of video clips and needs minimal human supervision. We also present a novel caption classifier that uses additional linguistic in...
Currently all video search engines are text-based, i.e. they search for the text labels associated w...
We address the problem of text-based activity retrieval in video. Given a sentence describing an act...
Understanding human activities in unconstrained natural videos is a widely studied problem, yet it r...
textRecognizing activities in real-world videos is a difficult problem exacerbated by background clu...
Recognizing activities in real-world videos is a difficult problem exacerbated by background clutter...
Recognizing activities in real-world videos is a difficult problem exacerbated by background clutter...
textCollecting and annotating videos of realistic human actions is tedious, yet critical for trainin...
Recent e orts in computer vision tackle the problem of human activity understanding in video sequenc...
Abstract. Recognizing activities in real-world videos is a chal-lenging AI problem. We present a nov...
Recently, the broad adoption of the internet coupled with connected smart devices has seen a signifi...
This paper presents a video OCR system that automatically extracts closed captions from video frames...
Page web de l'article : http://lear.inrialpes.fr/pubs/2009/GMS09/International audienceThis paper pr...
Enabling machines to solve computer vision tasks with natural language components can greatly improv...
Activity retrieval is a growing field in electrical engineering that specializes in the search and r...
We generate massive amounts of video data every day. While most real-world videos are long and untri...
Currently all video search engines are text-based, i.e. they search for the text labels associated w...
We address the problem of text-based activity retrieval in video. Given a sentence describing an act...
Understanding human activities in unconstrained natural videos is a widely studied problem, yet it r...
textRecognizing activities in real-world videos is a difficult problem exacerbated by background clu...
Recognizing activities in real-world videos is a difficult problem exacerbated by background clutter...
Recognizing activities in real-world videos is a difficult problem exacerbated by background clutter...
textCollecting and annotating videos of realistic human actions is tedious, yet critical for trainin...
Recent e orts in computer vision tackle the problem of human activity understanding in video sequenc...
Abstract. Recognizing activities in real-world videos is a chal-lenging AI problem. We present a nov...
Recently, the broad adoption of the internet coupled with connected smart devices has seen a signifi...
This paper presents a video OCR system that automatically extracts closed captions from video frames...
Page web de l'article : http://lear.inrialpes.fr/pubs/2009/GMS09/International audienceThis paper pr...
Enabling machines to solve computer vision tasks with natural language components can greatly improv...
Activity retrieval is a growing field in electrical engineering that specializes in the search and r...
We generate massive amounts of video data every day. While most real-world videos are long and untri...
Currently all video search engines are text-based, i.e. they search for the text labels associated w...
We address the problem of text-based activity retrieval in video. Given a sentence describing an act...
Understanding human activities in unconstrained natural videos is a widely studied problem, yet it r...