While recent machine learning research has revealed connections between deep generative models such as VAEs and rate-distortion losses used in learned compression, most of this work has focused on images. In a similar spirit, we view recently proposed neural video coding algorithms through the lens of deep autoregressive and latent variable modeling. We present recent neural video codecs as instances of a generalized stochastic temporal autoregressive transform, and propose new avenues for further improvements inspired by normalizing flows and structured priors. We propose several architectures that yield state-of-the-art video compression performance on full-resolution video and discuss their tradeoffs and ablations. In particular, we prop...
Deep generative models are a class of techniques that train deep neural networks to model the distri...
Video conferencing systems suffer from poor user experience when network conditions deteriorate beca...
Learned image compression allows achieving state-of-the-art accuracy and compression ratios, but the...
End-to-end image/video codecs are getting competitive compared to traditional compression techniques...
End-to-end deep trainable models are about to exceed the performance of the traditional handcrafted ...
Neural compression is the application of neural networks and other machine learning methods to data ...
We present the first neural video compression method based on generative adversarial networks (GANs)...
Humans do not perceive all parts of a scene with the same resolution, but rather focus on few region...
With the development of deep learning techniques, the combination of deep learning with image compre...
Traditional image and video compression algorithms rely on hand-crafted encoder/decoder pairs (codec...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
In video compression, coding efficiency is improved by reusing pixels from previously decoded frames...
Neural-based image and video codecs are significantly more power-efficient when weights and activati...
We introduce a novel generative model for video prediction based on latent flow matching, an efficie...
Neural networks have dramatically increased our capacity to learn from large, high-dimensional datas...
Deep generative models are a class of techniques that train deep neural networks to model the distri...
Video conferencing systems suffer from poor user experience when network conditions deteriorate beca...
Learned image compression allows achieving state-of-the-art accuracy and compression ratios, but the...
End-to-end image/video codecs are getting competitive compared to traditional compression techniques...
End-to-end deep trainable models are about to exceed the performance of the traditional handcrafted ...
Neural compression is the application of neural networks and other machine learning methods to data ...
We present the first neural video compression method based on generative adversarial networks (GANs)...
Humans do not perceive all parts of a scene with the same resolution, but rather focus on few region...
With the development of deep learning techniques, the combination of deep learning with image compre...
Traditional image and video compression algorithms rely on hand-crafted encoder/decoder pairs (codec...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
In video compression, coding efficiency is improved by reusing pixels from previously decoded frames...
Neural-based image and video codecs are significantly more power-efficient when weights and activati...
We introduce a novel generative model for video prediction based on latent flow matching, an efficie...
Neural networks have dramatically increased our capacity to learn from large, high-dimensional datas...
Deep generative models are a class of techniques that train deep neural networks to model the distri...
Video conferencing systems suffer from poor user experience when network conditions deteriorate beca...
Learned image compression allows achieving state-of-the-art accuracy and compression ratios, but the...