This dissertation provides a generic solution to model dynamic systems whose hidden state and the transition model are unknown in practice. We build the task-sufficient filtering framework to maintain a finite, abstract, and learnable representation (memory) that is sufficient to update itself, casually and iteratively, and to predict downstream task variables of interest. We show our realization of the framework by recurrent neural networks as universally-approximating function classes to imitate the functionality of a state transition model and a task prediction model.In addition, we provide practical methodologies to impose generic priors of the physical scene on the hidden representation. We leverage (lower-level) topological and regula...
We introduce the concept of dynamic image, a novel compact representation of videos useful for video...
Visual understanding of human behavior in video sequences is one of the fundamental topics in comput...
Deep neural networks are becoming central in several areas of computer vision. While there have been...
This dissertation provides a generic solution to model dynamic systems whose hidden state and the tr...
In this dissertation, I present my work towards exploring temporal information for better video unde...
AbstractWe present a trainable sequential-inference technique for processes with large state and obs...
Graduation date: 2017Access restricted to the OSU Community, at author's request, from December 13, ...
We address the challenge of learning good video representations by explicitly modeling the relations...
Recurrent models are a popular choice for video enhancement tasks such as video denoising or super-r...
The appearance of dynamic scenes is often largely governed by a latent low-dimensional dynamic proce...
One of the major research topics in computer vision is automatic video scene understanding where the...
Human actions captured in video sequences contain two crucial factors for action recognition, i.e., ...
This paper studies the dynamic generator model for spatialtemporal processes such as dynamic texture...
Transformers have recently been popular for learning and inference in the spatial-temporal domain. H...
International audienceWe study the problem of segmenting moving objects in unconstrained videos. Giv...
We introduce the concept of dynamic image, a novel compact representation of videos useful for video...
Visual understanding of human behavior in video sequences is one of the fundamental topics in comput...
Deep neural networks are becoming central in several areas of computer vision. While there have been...
This dissertation provides a generic solution to model dynamic systems whose hidden state and the tr...
In this dissertation, I present my work towards exploring temporal information for better video unde...
AbstractWe present a trainable sequential-inference technique for processes with large state and obs...
Graduation date: 2017Access restricted to the OSU Community, at author's request, from December 13, ...
We address the challenge of learning good video representations by explicitly modeling the relations...
Recurrent models are a popular choice for video enhancement tasks such as video denoising or super-r...
The appearance of dynamic scenes is often largely governed by a latent low-dimensional dynamic proce...
One of the major research topics in computer vision is automatic video scene understanding where the...
Human actions captured in video sequences contain two crucial factors for action recognition, i.e., ...
This paper studies the dynamic generator model for spatialtemporal processes such as dynamic texture...
Transformers have recently been popular for learning and inference in the spatial-temporal domain. H...
International audienceWe study the problem of segmenting moving objects in unconstrained videos. Giv...
We introduce the concept of dynamic image, a novel compact representation of videos useful for video...
Visual understanding of human behavior in video sequences is one of the fundamental topics in comput...
Deep neural networks are becoming central in several areas of computer vision. While there have been...