Video understanding is a booming research problem in computer vision. With its innate feature where spatial and temporal information entangles with each other, video understanding has been challenging mainly because of the difficulty for having a unified framework where these two aspects can be modeled jointly. Among the tasks in video understanding, human activity understanding and prediction serve as a good starting point where the spatial-temporal reasoning capability of learning modules can be tested. Most of the current approaches towards solving the human activity understanding and prediction problems use deep neural networks for spatial-temporal reasoning. However, this type of approach lacks the ability to reason beyond the local fr...
Visual understanding of human behavior in video sequences is one of the fundamental topics in comput...
Abstract. In this paper, we investigate the problem of forecasting fu-ture activities in continuous ...
The objective of this paper is self-supervised learning of spatio-temporal embeddings from video, su...
Video understanding is a booming research problem in computer vision. With its innate feature where ...
Video-based recognition and prediction of a temporally extended activity can benefit from a detailed...
Human behavior is a continuous stochastic spatio-temporal process which is governed by semantic acti...
In this work1, we present a method to represent a video with a sequence of words, and learn the temp...
Graduation date: 2014This dissertation addresses the problem of recognizing human activities in vide...
We propose a probabilistic method for parsing a tempo-ral sequence such as a complex activity define...
In this paper, we present a novel approach of human activity prediction. Human activity prediction i...
This thesis contributes to the literature of understanding and recognizing human activities in video...
Abstract Stochastic Context-Free Grammars (SCFG) have been shown to be useful for vision-based human...
Abstract. Stochastic grammar has been used in many video analysis and event recognition applications...
This thesis presents a complete computational framework for discovering human actions and modeling h...
We propose a new graphical model, called Sequential Interval Network (SIN), for parsing complex stru...
Visual understanding of human behavior in video sequences is one of the fundamental topics in comput...
Abstract. In this paper, we investigate the problem of forecasting fu-ture activities in continuous ...
The objective of this paper is self-supervised learning of spatio-temporal embeddings from video, su...
Video understanding is a booming research problem in computer vision. With its innate feature where ...
Video-based recognition and prediction of a temporally extended activity can benefit from a detailed...
Human behavior is a continuous stochastic spatio-temporal process which is governed by semantic acti...
In this work1, we present a method to represent a video with a sequence of words, and learn the temp...
Graduation date: 2014This dissertation addresses the problem of recognizing human activities in vide...
We propose a probabilistic method for parsing a tempo-ral sequence such as a complex activity define...
In this paper, we present a novel approach of human activity prediction. Human activity prediction i...
This thesis contributes to the literature of understanding and recognizing human activities in video...
Abstract Stochastic Context-Free Grammars (SCFG) have been shown to be useful for vision-based human...
Abstract. Stochastic grammar has been used in many video analysis and event recognition applications...
This thesis presents a complete computational framework for discovering human actions and modeling h...
We propose a new graphical model, called Sequential Interval Network (SIN), for parsing complex stru...
Visual understanding of human behavior in video sequences is one of the fundamental topics in comput...
Abstract. In this paper, we investigate the problem of forecasting fu-ture activities in continuous ...
The objective of this paper is self-supervised learning of spatio-temporal embeddings from video, su...