Video content is watched not only by humans, but increasingly also by machines. For example, machine learning models analyze surveillance video for security and traffic monitoring, search through YouTube videos for inappropriate content, and so on. In this paper, we propose a scalable video coding framework that supports machine vision (specifically, object detection) through its base layer bitstream and human vision via its enhancement layer bitstream. The proposed framework includes components from both conventional and Deep Neural Network (DNN)-based video coding. The results show that on object detection, the proposed framework achieves 13-19% bit savings compared to state-of-the-art video codecs, while remaining competitive in terms of...
Machine-To-Machine (M2M) communication applications and use cases, such as object detection and inst...
This paper introduces our recent research work on the development of a scalable foveated visual info...
The development of multimedia propagations and applications has led to a greater expansion in the f...
Almost all digital videos are coded into compact representations before being transmitted. Such comp...
The proliferation of deep learning-based machine vision applications has given rise to a new type of...
This paper provides a methodology to encode video objects in a scalable manner with regard to both c...
Traditional media coding schemes typically encode image/video into a semantic-unknown binary stream,...
Over recent years, deep learning-based computer vision systems have been applied to images at an eve...
Video coding algorithms encode and decode an entire video frame while feature coding techniques only...
Video and image coding for machines (VCM) is an emerging field that aims to develop compression meth...
Today, many image coding scenarios do not have a human as final intended user, but rather a machine ...
Throughout our life, we humans perceive the visual world, connect what we see over time, and make se...
International audienceIn the Video Coding for Machines (VCM) context where visual content is compres...
We investigate video classification via a 3D deep convolutional neural network (CNN) that directly ...
Vision systems have become ubiquitous. They are used for traffic monitoring, elderly care, video con...
Machine-To-Machine (M2M) communication applications and use cases, such as object detection and inst...
This paper introduces our recent research work on the development of a scalable foveated visual info...
The development of multimedia propagations and applications has led to a greater expansion in the f...
Almost all digital videos are coded into compact representations before being transmitted. Such comp...
The proliferation of deep learning-based machine vision applications has given rise to a new type of...
This paper provides a methodology to encode video objects in a scalable manner with regard to both c...
Traditional media coding schemes typically encode image/video into a semantic-unknown binary stream,...
Over recent years, deep learning-based computer vision systems have been applied to images at an eve...
Video coding algorithms encode and decode an entire video frame while feature coding techniques only...
Video and image coding for machines (VCM) is an emerging field that aims to develop compression meth...
Today, many image coding scenarios do not have a human as final intended user, but rather a machine ...
Throughout our life, we humans perceive the visual world, connect what we see over time, and make se...
International audienceIn the Video Coding for Machines (VCM) context where visual content is compres...
We investigate video classification via a 3D deep convolutional neural network (CNN) that directly ...
Vision systems have become ubiquitous. They are used for traffic monitoring, elderly care, video con...
Machine-To-Machine (M2M) communication applications and use cases, such as object detection and inst...
This paper introduces our recent research work on the development of a scalable foveated visual info...
The development of multimedia propagations and applications has led to a greater expansion in the f...