In this paper, we propose a general framework for image classification using the attention mechanism and global context, which could incorporate with various network architectures to improve their performance. To investigate the capability of the global context, we compare four mathematical models and observe the global context encoded in the category disentangled conditional generative model could give more guidance as "know what is task irrelevant will also know what is relevant". Based on this observation, we define a novel Category Disentangled Global Context (CDGC) and devise a deep network to obtain it. By attending CDGC, the baseline networks could identify the objects of interest more accurately, thus improving the performance. We a...
Paper accepted to ICANN 2021 - The 30th International Conference on Artificial Neural NetworksIntern...
In the recent advancements attention mechanism in deep learning had played a vital role in proving b...
Deep convolutional neural networks (CNNs) have shown a strong ability in mining discriminative objec...
This work introduces an attention mechanism for image classifiers and the corresponding deep neural ...
Over the past few years, a significant progress has been made in deep convolutional neural networks ...
Deeply learned representations have achieved superior image retrieval performance in a retrieve-then...
We propose global context vision transformer (GC ViT), a novel architecture that enhances parameter ...
Which object detector is suitable for your context sensitive task? Deep object detectors exploit sce...
Since the recent success of Vision Transformers (ViTs), explorations toward transformer-style archit...
Predicting salient regions in natural images requires the detection of objects that are present in a...
The vanilla self-attention mechanism inherently relies on pre-defined and steadfast computational di...
Convolution Neural Networks (CNN) and Recurrent Neural Networks (RNN), which are the main research m...
Recently, attention mechanisms have been explored with ConvNets, both across the spatial and channel...
Deep neural networks have reached human-level performance on many computer vision tasks. However, th...
Attention guidance is an approach to addressing dataset bias in deep learning, where the model relie...
Paper accepted to ICANN 2021 - The 30th International Conference on Artificial Neural NetworksIntern...
In the recent advancements attention mechanism in deep learning had played a vital role in proving b...
Deep convolutional neural networks (CNNs) have shown a strong ability in mining discriminative objec...
This work introduces an attention mechanism for image classifiers and the corresponding deep neural ...
Over the past few years, a significant progress has been made in deep convolutional neural networks ...
Deeply learned representations have achieved superior image retrieval performance in a retrieve-then...
We propose global context vision transformer (GC ViT), a novel architecture that enhances parameter ...
Which object detector is suitable for your context sensitive task? Deep object detectors exploit sce...
Since the recent success of Vision Transformers (ViTs), explorations toward transformer-style archit...
Predicting salient regions in natural images requires the detection of objects that are present in a...
The vanilla self-attention mechanism inherently relies on pre-defined and steadfast computational di...
Convolution Neural Networks (CNN) and Recurrent Neural Networks (RNN), which are the main research m...
Recently, attention mechanisms have been explored with ConvNets, both across the spatial and channel...
Deep neural networks have reached human-level performance on many computer vision tasks. However, th...
Attention guidance is an approach to addressing dataset bias in deep learning, where the model relie...
Paper accepted to ICANN 2021 - The 30th International Conference on Artificial Neural NetworksIntern...
In the recent advancements attention mechanism in deep learning had played a vital role in proving b...
Deep convolutional neural networks (CNNs) have shown a strong ability in mining discriminative objec...