Depth can provide useful geographical cues for salient object detection (SOD), and has been proven helpful in recent RGB-D SOD methods. However, existing video salient object detection (VSOD) methods only utilize spatiotemporal information and seldom exploit depth information for detection. In this paper, we propose a depth-cooperated trimodal network, called DCTNet for VSOD, which is a pioneering work to incorporate depth information to assist VSOD. To this end, we first generate depth from RGB frames, and then propose an approach to treat the three modalities unequally. Specifically, a multi-modal attention module (MAM) is designed to model multi-modal long-range dependencies between the main modality (RGB) and the two auxiliary modalitie...
International audienceEfficiently exploiting multi-modal inputs for accurate RGB-D saliency detectio...
RGB-D salient object detection is one of the basic tasks in computer vision. Most existing models fo...
Previous methods based on 3DCNN, convLSTM, or optical flow have achieved great success in video sali...
Abstract Recently proposed state-of-the-art saliency detection models rely heavily on labeled datase...
Benefiting from color independence, illumination invariance and location discrimination attributed b...
International audienceRecent RGBD-based models for saliency detection have attracted research attent...
Focusing on the issue of how to effectively capture and utilize cross-modality information in RGB-D ...
Salient Object Detection is the task of predicting the human attended region in a given scene. Fusin...
RGB-D salient object detection (SOD) aims at locating the most eye-catching object in visual input b...
RGB-D saliency object detection (SOD) is an important pre-processing operation for various computer ...
Abstract The effective integration of RGB and depth map features to improve the performance of RGB‐D...
International audienceRGB-D saliency detection aims to fuse multi-modal cues to accurately localize ...
The existing saliency detection models based on RGB colors only leverage appearance cues to detect s...
Salient object detection (SOD) extracts meaningful contents from an input image. RGB-based SOD metho...
Most existing RGB-D salient detection models pay more attention to the quality of the depth images, ...
International audienceEfficiently exploiting multi-modal inputs for accurate RGB-D saliency detectio...
RGB-D salient object detection is one of the basic tasks in computer vision. Most existing models fo...
Previous methods based on 3DCNN, convLSTM, or optical flow have achieved great success in video sali...
Abstract Recently proposed state-of-the-art saliency detection models rely heavily on labeled datase...
Benefiting from color independence, illumination invariance and location discrimination attributed b...
International audienceRecent RGBD-based models for saliency detection have attracted research attent...
Focusing on the issue of how to effectively capture and utilize cross-modality information in RGB-D ...
Salient Object Detection is the task of predicting the human attended region in a given scene. Fusin...
RGB-D salient object detection (SOD) aims at locating the most eye-catching object in visual input b...
RGB-D saliency object detection (SOD) is an important pre-processing operation for various computer ...
Abstract The effective integration of RGB and depth map features to improve the performance of RGB‐D...
International audienceRGB-D saliency detection aims to fuse multi-modal cues to accurately localize ...
The existing saliency detection models based on RGB colors only leverage appearance cues to detect s...
Salient object detection (SOD) extracts meaningful contents from an input image. RGB-based SOD metho...
Most existing RGB-D salient detection models pay more attention to the quality of the depth images, ...
International audienceEfficiently exploiting multi-modal inputs for accurate RGB-D saliency detectio...
RGB-D salient object detection is one of the basic tasks in computer vision. Most existing models fo...
Previous methods based on 3DCNN, convLSTM, or optical flow have achieved great success in video sali...