This Letter presents an attention‐modulating network for video object segmentation that can well adapt its segmentation model to the annotated frame. Specifically, the authors first develop an efficient visual and spatial attention modulator to fast modulate the segmentation model to focus on the specific object of interest. Then they design a channel and spatial attention module and inject it into the segmentation model to further refine its feature maps. In addition, to fuse multi‐scale context information, they construct a feature pyramid attention module to further process the top layer feature maps, achieving better pixel‐level attention for the high‐level feature maps. Finally, to address the sample imbalance issue in training, they e...
International audienceCurrent state-of-the art approaches to action recognition emphasize learning C...
There has been historic progress in the field of image understanding over the past few years. Deep l...
While originally designed for natural language processing tasks, the self-attention mechanism has re...
Semi-supervised video object segmentation is a fundamental yet Challenging task in computer vision. ...
Abstract To improve the segmentation performance on videos with large object motion or deformation, ...
Semantic segmentation is a task that covers most of the perception needs of intelligent vehicles in ...
We introduce a simple yet effective network that embeds a novel Discriminative Feature Pooling (DFP)...
Temporal segmentation of events is an essential task and a precursor for the automatic recognition o...
Video object segmentation is the task of estimating foreground object segments from the background t...
International audienceVideo Object Segmentation (VOS) is crucial for several applications, from vide...
Abstract It is a big challenge for unsupervised video segmentation without any object annotation or ...
Object detection and segmentation are important computer vision problems that have applications in s...
This paper presents a novel approach for segmenting moving objects in unconstrained environments usi...
Video object segmentation (VOS) is a highly challenging problem since the initial mask, defining the...
Video-based computer vision tasks can benefit from estimation of the salient regions and interaction...
International audienceCurrent state-of-the art approaches to action recognition emphasize learning C...
There has been historic progress in the field of image understanding over the past few years. Deep l...
While originally designed for natural language processing tasks, the self-attention mechanism has re...
Semi-supervised video object segmentation is a fundamental yet Challenging task in computer vision. ...
Abstract To improve the segmentation performance on videos with large object motion or deformation, ...
Semantic segmentation is a task that covers most of the perception needs of intelligent vehicles in ...
We introduce a simple yet effective network that embeds a novel Discriminative Feature Pooling (DFP)...
Temporal segmentation of events is an essential task and a precursor for the automatic recognition o...
Video object segmentation is the task of estimating foreground object segments from the background t...
International audienceVideo Object Segmentation (VOS) is crucial for several applications, from vide...
Abstract It is a big challenge for unsupervised video segmentation without any object annotation or ...
Object detection and segmentation are important computer vision problems that have applications in s...
This paper presents a novel approach for segmenting moving objects in unconstrained environments usi...
Video object segmentation (VOS) is a highly challenging problem since the initial mask, defining the...
Video-based computer vision tasks can benefit from estimation of the salient regions and interaction...
International audienceCurrent state-of-the art approaches to action recognition emphasize learning C...
There has been historic progress in the field of image understanding over the past few years. Deep l...
While originally designed for natural language processing tasks, the self-attention mechanism has re...