Predicting future frames in videos has become a promising direction of research for both computer vision and robot learning communities. The core of this problem involves moving object capture and future motion prediction. While object capture specifies which objects are moving in videos, motion prediction describes their future dynamics. Motivated by this analysis, we propose a Cubic Long Short-Term Memory (CubicLSTM) unit for video prediction. CubicLSTM consists of three branches, i.e., a spatial branch for capturing moving objects, a temporal branch for processing motions, and an output branch for combining the first two branches to generate predicted frames. Stacking multiple CubicLSTM units along the spatial branch and output branch, a...
Autonomous systems not only need to understand their current environment, but should also be able to...
Analyzing and understanding human actions in long-range videos has promising applications, such as v...
While great strides have been made in using deep learning algorithms to solve supervised learning ta...
International audienceRecently, video prediction algorithms based on neural networks have become a p...
The use of recurrent neural networks in several applications has allowed to capture impressive resul...
Deep neural networks are becoming central in several areas of computer vision. While there have been...
Action prediction based on video is an important problem in computer vision field with many applicat...
Our work addresses long-term motion context issues for predicting future frames. To predict the futu...
While recent deep learning methods have made significant progress on the video prediction problem, m...
Visual-frame prediction is a pixel-dense prediction task that infers future frames from past frames....
Transformers have recently been popular for learning and inference in the spatial-temporal domain. H...
Modular neural networks have received an upsurge of attention lately owing to their unique modular d...
We use Long Short Term Memory (LSTM) networks to learn representations of video se-quences. Our mode...
(a) Inference on an example input image sequence of 10 frames. Top to bottom: Input sequence; model’...
This work introduces double-mapping Gated Recurrent Units (dGRU), an extension of standard GRUs wher...
Autonomous systems not only need to understand their current environment, but should also be able to...
Analyzing and understanding human actions in long-range videos has promising applications, such as v...
While great strides have been made in using deep learning algorithms to solve supervised learning ta...
International audienceRecently, video prediction algorithms based on neural networks have become a p...
The use of recurrent neural networks in several applications has allowed to capture impressive resul...
Deep neural networks are becoming central in several areas of computer vision. While there have been...
Action prediction based on video is an important problem in computer vision field with many applicat...
Our work addresses long-term motion context issues for predicting future frames. To predict the futu...
While recent deep learning methods have made significant progress on the video prediction problem, m...
Visual-frame prediction is a pixel-dense prediction task that infers future frames from past frames....
Transformers have recently been popular for learning and inference in the spatial-temporal domain. H...
Modular neural networks have received an upsurge of attention lately owing to their unique modular d...
We use Long Short Term Memory (LSTM) networks to learn representations of video se-quences. Our mode...
(a) Inference on an example input image sequence of 10 frames. Top to bottom: Input sequence; model’...
This work introduces double-mapping Gated Recurrent Units (dGRU), an extension of standard GRUs wher...
Autonomous systems not only need to understand their current environment, but should also be able to...
Analyzing and understanding human actions in long-range videos has promising applications, such as v...
While great strides have been made in using deep learning algorithms to solve supervised learning ta...