This work introduces double-mapping Gated Recurrent Units (dGRU), an extension of standard GRUs where the input is considered as a recurrent state. An extra set of logic gates is added to update the input given the output. Stacking multiple such layers results in a recurrent auto-encoder: the operators updating the outputs comprise the encoder, while the ones updating the inputs form the decoder. Since the states are shared between corresponding encoder and decoder layers, the representation is stratified during learning: some information is not passed to the next layers. We test our model on future video prediction. Main challenges for this task include high variability in videos, temporal propagation of errors, and non-specificity of futu...
Predicting future frames in videos has become a promising direction of research for both computer vi...
It is well believed that video captioning is a fundamental but challenging task in both computer vis...
In this paper, we introduce an approach for future frames prediction based on a single input image. ...
Deep neural networks are becoming central in several areas of computer vision. While there have been...
The temporal events in video sequences often have long-term dependencies which are difficult to be h...
Modular neural networks have received an upsurge of attention lately owing to their unique modular d...
Despite the recent popularity of deep generative state space models, few comparisons have been made ...
Typical video classification methods often divide a video into short clips, do inference on each cli...
International audienceRecently, video prediction algorithms based on neural networks have become a p...
Typical video classification methods often divide a video into short clips, do inference on each cli...
The ability to predict future states of the environment is a central pillar of intelligence. At its ...
We propose modeling time series by representing the transformations that take a frame at time t to a...
While recent deep learning methods have made significant progress on the video prediction problem, m...
(a) Inference on an example input image sequence of 10 frames. Top to bottom: Input sequence; model’...
© 2016 IEEE. Recently, deep learning approach, especially deep Convolutional Neural Networks (ConvNe...
Predicting future frames in videos has become a promising direction of research for both computer vi...
It is well believed that video captioning is a fundamental but challenging task in both computer vis...
In this paper, we introduce an approach for future frames prediction based on a single input image. ...
Deep neural networks are becoming central in several areas of computer vision. While there have been...
The temporal events in video sequences often have long-term dependencies which are difficult to be h...
Modular neural networks have received an upsurge of attention lately owing to their unique modular d...
Despite the recent popularity of deep generative state space models, few comparisons have been made ...
Typical video classification methods often divide a video into short clips, do inference on each cli...
International audienceRecently, video prediction algorithms based on neural networks have become a p...
Typical video classification methods often divide a video into short clips, do inference on each cli...
The ability to predict future states of the environment is a central pillar of intelligence. At its ...
We propose modeling time series by representing the transformations that take a frame at time t to a...
While recent deep learning methods have made significant progress on the video prediction problem, m...
(a) Inference on an example input image sequence of 10 frames. Top to bottom: Input sequence; model’...
© 2016 IEEE. Recently, deep learning approach, especially deep Convolutional Neural Networks (ConvNe...
Predicting future frames in videos has become a promising direction of research for both computer vi...
It is well believed that video captioning is a fundamental but challenging task in both computer vis...
In this paper, we introduce an approach for future frames prediction based on a single input image. ...