Incorporating geometric transformations that reflect the relative position changes between an observer and an object into computer vision and deep learning models has attracted much attention in recent years. However, the existing proposals mainly focus on affine transformations that cannot fully show viewpoint changes. Furthermore, current solutions often apply a neural network module to learn a single transformation matrix, which ignores the possibility for various viewpoints and creates extra to-be-trained module parameters. We propose a layer (PT layer) to learn the perspective transformations that not only model the geometries in affine transformation but also reflect the viewpoint changes. In addition, being able to be directly traine...
In this work, we present a network architecture with parallel convolutional neural networks (CNN) fo...
Learning to perceive scenes and objects from 2D images as 3D models is atrivial task for a human but...
Recently geometric deep learning introduced a new way for machine learning algorithms to tackle poin...
Incorporating geometric transformations that reflect the relative position changes between an observ...
Images taken under different camera poses are rotated or distorted, which leads to poor perception e...
Under the assumption of weak perspective, two views of the same planar object are related through an...
Local processing is an essential feature of CNNs and other neural network architectures - it is one ...
We consider the problem of learning the mapping between the image coordinates of unknown affine view...
In order to perform object recognition, it is necessary to form perceptual representations that are ...
Humans are remarkably flexible in understanding viewpoint changes due to visual cortex supporting th...
The goal of machine vision is to develop human-like visual abilities; however, it is unclear whether...
To form view-invariant representations of objects, neurons in the inferior temporal cortex may assoc...
We introduce gvnn, a neural network library in Torch aimed towards bridging the gap between classic ...
Multi-view projection techniques have shown themselves to be highly effective in achieving top-perfo...
In this work we introduce a new self-supervised, semi-parametric approach for synthesizing novel vie...
In this work, we present a network architecture with parallel convolutional neural networks (CNN) fo...
Learning to perceive scenes and objects from 2D images as 3D models is atrivial task for a human but...
Recently geometric deep learning introduced a new way for machine learning algorithms to tackle poin...
Incorporating geometric transformations that reflect the relative position changes between an observ...
Images taken under different camera poses are rotated or distorted, which leads to poor perception e...
Under the assumption of weak perspective, two views of the same planar object are related through an...
Local processing is an essential feature of CNNs and other neural network architectures - it is one ...
We consider the problem of learning the mapping between the image coordinates of unknown affine view...
In order to perform object recognition, it is necessary to form perceptual representations that are ...
Humans are remarkably flexible in understanding viewpoint changes due to visual cortex supporting th...
The goal of machine vision is to develop human-like visual abilities; however, it is unclear whether...
To form view-invariant representations of objects, neurons in the inferior temporal cortex may assoc...
We introduce gvnn, a neural network library in Torch aimed towards bridging the gap between classic ...
Multi-view projection techniques have shown themselves to be highly effective in achieving top-perfo...
In this work we introduce a new self-supervised, semi-parametric approach for synthesizing novel vie...
In this work, we present a network architecture with parallel convolutional neural networks (CNN) fo...
Learning to perceive scenes and objects from 2D images as 3D models is atrivial task for a human but...
Recently geometric deep learning introduced a new way for machine learning algorithms to tackle poin...