RGB image classification has achieved significant performance improvement with the resurge of deep convolutional neural networks. However, mono-modal deep models for RGB image still have several limitations when applied to RGB-D scene recognition. 1) Images for scene classification usually contain more than one typical object with flexible spatial distribution, so the object-level local features should also be considered in addition to global scene representation. 2) Multi-modal features in RGB-D scene classification are still under-utilized. Simply combining these modal-specific features suffers from the semantic gaps between different modalities. 3) Most existing methods neglect the complex relationships among multiple modality features. ...
Classification of indoor environments is a challenging problem. The availability of low-cost depth s...
This paper addresses the object recognition problem using multiple-domain inputs. We present a novel...
In this letter, a deep-learning-based pipeline is proposed to distinguish photographics (PGs) from c...
Scene recognition with RGB images has been extensively studied and has reached very remarkable recog...
While deep convolutional neural networks have shown a remarkable success in image classification, th...
Scene recognition is one of the basic problems in computer vision research with extensive applicatio...
This paper presents a novel multi-modal CNN architecture that exploits complementary input cues in a...
Recognizing semantic category of objects and scenes captured using vision-based sensors is a challen...
RGB and depth modalities contain more abundant and interactive information, and convolutional neural...
© 1979-2012 IEEE. People can recognize scenes across many different modalities beyond natural images...
RGB-D data has turned out to be a very useful representation for solving fundamental computer visio...
A novel deep neural network training paradigm that exploits the conjoint information in multiple het...
This paper focuses on the task of RGB-D indoor scene classification. It is a very challenging task d...
While convolutional neural networks (CNNs) have been excellent for object recognition, the greater s...
Deep learning based object recognition methods have achieved unprecedented success in the recent yea...
Classification of indoor environments is a challenging problem. The availability of low-cost depth s...
This paper addresses the object recognition problem using multiple-domain inputs. We present a novel...
In this letter, a deep-learning-based pipeline is proposed to distinguish photographics (PGs) from c...
Scene recognition with RGB images has been extensively studied and has reached very remarkable recog...
While deep convolutional neural networks have shown a remarkable success in image classification, th...
Scene recognition is one of the basic problems in computer vision research with extensive applicatio...
This paper presents a novel multi-modal CNN architecture that exploits complementary input cues in a...
Recognizing semantic category of objects and scenes captured using vision-based sensors is a challen...
RGB and depth modalities contain more abundant and interactive information, and convolutional neural...
© 1979-2012 IEEE. People can recognize scenes across many different modalities beyond natural images...
RGB-D data has turned out to be a very useful representation for solving fundamental computer visio...
A novel deep neural network training paradigm that exploits the conjoint information in multiple het...
This paper focuses on the task of RGB-D indoor scene classification. It is a very challenging task d...
While convolutional neural networks (CNNs) have been excellent for object recognition, the greater s...
Deep learning based object recognition methods have achieved unprecedented success in the recent yea...
Classification of indoor environments is a challenging problem. The availability of low-cost depth s...
This paper addresses the object recognition problem using multiple-domain inputs. We present a novel...
In this letter, a deep-learning-based pipeline is proposed to distinguish photographics (PGs) from c...