The goal of this work is to understand the way actions are performed in videos. That is, given a video, we aim to predict an adverb indicating a modification applied to the action (e.g. cut “finely”). We cast this problem as a regression task. We measure textual relationships between verbs and adverbs to generate a regression target representing the action change we aim to learn. We test our approach on a range of datasets and achieve state-of-the-art results on both adverb prediction and antonym classification. Furthermore, we outperform previous work when we lift two commonly assumed conditions: the availability of action labels during testing and the pairing of adverbs as antonyms. Existing datasets for adverb recognition are either nois...
We propose a layered-grammar model to represent actions. Using this model, an action is represented ...
Video action recognition has been in the center of the stage since its introduction in 2004 [SLC04]....
A huge amount of videos have been created, spread, and viewed daily. Among these massive videos, the...
Video understanding is a research hotspot of computer vision and significant progress has been made ...
We aim to understand how actions are performed and identify subtle differences, such as 'fold firmly...
Recent work has shown that the integration of visual information into text-based models can substant...
This research presents a novel method for learning the lexical semantics of action verbs. The primar...
International audienceThis paper exploits the context of natural dynamic scenes for human action rec...
This reviews, motivates, and extends the event analysis of action sentences and shows how it explain...
International audienceLinguistic ambiguities arising from changes in entities in action flows are a ...
Video is becoming more and more popular as a learning medium in a variety of educational settings, ...
Can we teach a robot to recognize and make predictions for activities that it has never seen before?...
Abstract. Recognizing activities in real-world videos is a chal-lenging AI problem. We present a nov...
In this article, we deal with the problem of predicting action progress in videos. We argue that thi...
This paper presents a novel approach for analyzing human actions in non-scripted, unconstrained vide...
We propose a layered-grammar model to represent actions. Using this model, an action is represented ...
Video action recognition has been in the center of the stage since its introduction in 2004 [SLC04]....
A huge amount of videos have been created, spread, and viewed daily. Among these massive videos, the...
Video understanding is a research hotspot of computer vision and significant progress has been made ...
We aim to understand how actions are performed and identify subtle differences, such as 'fold firmly...
Recent work has shown that the integration of visual information into text-based models can substant...
This research presents a novel method for learning the lexical semantics of action verbs. The primar...
International audienceThis paper exploits the context of natural dynamic scenes for human action rec...
This reviews, motivates, and extends the event analysis of action sentences and shows how it explain...
International audienceLinguistic ambiguities arising from changes in entities in action flows are a ...
Video is becoming more and more popular as a learning medium in a variety of educational settings, ...
Can we teach a robot to recognize and make predictions for activities that it has never seen before?...
Abstract. Recognizing activities in real-world videos is a chal-lenging AI problem. We present a nov...
In this article, we deal with the problem of predicting action progress in videos. We argue that thi...
This paper presents a novel approach for analyzing human actions in non-scripted, unconstrained vide...
We propose a layered-grammar model to represent actions. Using this model, an action is represented ...
Video action recognition has been in the center of the stage since its introduction in 2004 [SLC04]....
A huge amount of videos have been created, spread, and viewed daily. Among these massive videos, the...