PredNet, a deep predictive coding network developed by Lotter et al., combines a biologically inspired architecture based on the propagation of prediction error with self-supervised representation learning in video. While the architecture has drawn a lot of attention and various extensions of the model exist, there is a lack of a critical analysis. We fill in the gap by evaluating PredNet both as an implementation of the predictive coding theory and as a self-supervised video prediction model using a challenging video action classification dataset. We design an extended model to test if conditioning future frame predictions on the action class of the video improves the model performance. We show that PredNet does not yet completely follow t...
(a) Inference on an example input image sequence of 10 frames. Top to bottom: Input sequence; model’...
Visual-frame prediction is a pixel-dense prediction task that infers future frames from past frames....
While recent deep learning methods have made significant progress on the video prediction problem, m...
PredNet, a deep predictive coding network developed by Lotter et al., combines a biologically inspir...
While great strides have been made in using deep learning algorithms to solve supervised learning ta...
International audienceIn this paper, we propose a self-supervised method for video representation le...
The objective of this paper is self-supervised learning of spatio-temporal embeddings from video, su...
The objective of this paper is self-supervised learning from video, in particular for representation...
The ability to predict, anticipate and reason about future outcomes is a key component of intelligen...
International audienceContrastive Predictive Coding (CPC) (van den Oord et al., 2018) has been succe...
International audienceDeep neural networks excel at image classification, but their performance is f...
Predictive coding is an unsupervised learning principle which has been proposed to explain the brain...
Self-supervised learning methods overcome the key bottleneck for building more capable AI: limited a...
International audienceTo effectively manage and utilize the massive amount of visual data generated ...
The cerebral cortex predicts visual motion to adapt human behavior to surrounding objects moving in ...
(a) Inference on an example input image sequence of 10 frames. Top to bottom: Input sequence; model’...
Visual-frame prediction is a pixel-dense prediction task that infers future frames from past frames....
While recent deep learning methods have made significant progress on the video prediction problem, m...
PredNet, a deep predictive coding network developed by Lotter et al., combines a biologically inspir...
While great strides have been made in using deep learning algorithms to solve supervised learning ta...
International audienceIn this paper, we propose a self-supervised method for video representation le...
The objective of this paper is self-supervised learning of spatio-temporal embeddings from video, su...
The objective of this paper is self-supervised learning from video, in particular for representation...
The ability to predict, anticipate and reason about future outcomes is a key component of intelligen...
International audienceContrastive Predictive Coding (CPC) (van den Oord et al., 2018) has been succe...
International audienceDeep neural networks excel at image classification, but their performance is f...
Predictive coding is an unsupervised learning principle which has been proposed to explain the brain...
Self-supervised learning methods overcome the key bottleneck for building more capable AI: limited a...
International audienceTo effectively manage and utilize the massive amount of visual data generated ...
The cerebral cortex predicts visual motion to adapt human behavior to surrounding objects moving in ...
(a) Inference on an example input image sequence of 10 frames. Top to bottom: Input sequence; model’...
Visual-frame prediction is a pixel-dense prediction task that infers future frames from past frames....
While recent deep learning methods have made significant progress on the video prediction problem, m...