Existing techniques to encode spatial invariance within deep convolutional neural networks (CNNs) apply the same warping field to all the feature channels. This does not account for the fact that the individual feature channels can represent different semantic parts, which can undergo different spatial transformations w.r.t. a canonical configuration. To overcome this limitation, we introduce a learnable module, the volumetric transformer network (VTN), that predicts channel-wise warping fields so as to reconfigure intermediate CNN features spatially and channel-wisely. We design our VTN as an encoder-decoder network, with modules dedicated to letting the information flow across the feature channels, to account for the dependencies between ...
Image dehazing is challenging due to the problem of ill-posed parameter estimation. Numerous prior-b...
An important goal in visual recognition is to devise image representations that are invariant to par...
Inspired by two basic mechanisms in animal visual systems, we introduce a feature transform techniqu...
Spatial transformer networks (STNs) were designed to enable convolutional neural networks (CNNs) to ...
Convolutional Neural Networks (CNNs) are extremely efficient, since they exploit the inherent transl...
Introducing variation in the training dataset through data augmentation has been a popular technique...
We show that spatial transformations of CNN feature maps cannot align the feature maps of a transfor...
This thesis addresses the problem of investigating the properties and abilities of a variety of comp...
Spatial transformer networks (STNs) were designed to enable CNNs to learn invariance to image transf...
© 2016 ACM. Convolutional neural networks (CNNs) have achieved stateof-the-art results on many visua...
In this paper, we propose a fully convolutional network-based dense map from voxels to invertible pa...
Semantic segmentation solves the task of labelling every pixel inan image with its class label, and ...
While convolutional neural networks have shown a tremendous impact on various computer vision tasks,...
In recent years, convolutional networks have dramatically (re)emerged as the dominant paradigm for s...
Spatial Transformer Networks (STNs) have the potential to dramatically improve performance of convol...
Image dehazing is challenging due to the problem of ill-posed parameter estimation. Numerous prior-b...
An important goal in visual recognition is to devise image representations that are invariant to par...
Inspired by two basic mechanisms in animal visual systems, we introduce a feature transform techniqu...
Spatial transformer networks (STNs) were designed to enable convolutional neural networks (CNNs) to ...
Convolutional Neural Networks (CNNs) are extremely efficient, since they exploit the inherent transl...
Introducing variation in the training dataset through data augmentation has been a popular technique...
We show that spatial transformations of CNN feature maps cannot align the feature maps of a transfor...
This thesis addresses the problem of investigating the properties and abilities of a variety of comp...
Spatial transformer networks (STNs) were designed to enable CNNs to learn invariance to image transf...
© 2016 ACM. Convolutional neural networks (CNNs) have achieved stateof-the-art results on many visua...
In this paper, we propose a fully convolutional network-based dense map from voxels to invertible pa...
Semantic segmentation solves the task of labelling every pixel inan image with its class label, and ...
While convolutional neural networks have shown a tremendous impact on various computer vision tasks,...
In recent years, convolutional networks have dramatically (re)emerged as the dominant paradigm for s...
Spatial Transformer Networks (STNs) have the potential to dramatically improve performance of convol...
Image dehazing is challenging due to the problem of ill-posed parameter estimation. Numerous prior-b...
An important goal in visual recognition is to devise image representations that are invariant to par...
Inspired by two basic mechanisms in animal visual systems, we introduce a feature transform techniqu...