Incorporating geometric transformations that reflect the relative position changes between an observer and an object into computer vision and deep learning models has attracted much attention in recent years. However, the existing proposals mainly focus on the affine transformation that is insufficient to reflect such geometric position changes. Furthermore, current solutions often apply a neural network module to learn a single transformation matrix, which not only ignores the importance of multi-view analysis but also includes extra training parameters from the module apart from the transformation matrix parameters that increase the model complexity. In this paper, a perspective transformation layer is proposed in the context of deep lear...
To form view-invariant representations of objects, neurons in the inferior temporal cortex may assoc...
In this work, we present a network architecture with parallel convolutional neural networks (CNN) fo...
Humans are remarkably flexible in understanding viewpoint changes due to visual cortex supporting th...
Incorporating geometric transformations that reflect the relative position changes between an observ...
The objective of this paper is to rectify any monocular image by computing a homography matrix that ...
Local processing is an essential feature of CNNs and other neural network architectures - it is one ...
Images taken under different camera poses are rotated or distorted, which leads to poor perception e...
Homography is an important area of computer vision for scene understanding and plays a key role in e...
Planar homography estimation refers to the problem of computing a bijective linear mapping of pixels...
In order to perform object recognition, it is necessary to form perceptual representations that are ...
The goal of machine vision is to develop human-like visual abilities; however, it is unclear whether...
Gaze and head pose estimation can play essential roles in various applications, such as human attent...
Under the assumption of weak perspective, two views of the same planar object are related through an...
A collection of images of a scene captured from different perspectives inform us about the scene's c...
In order to perform object recognition, it is necessary to form perceptual representations that are ...
To form view-invariant representations of objects, neurons in the inferior temporal cortex may assoc...
In this work, we present a network architecture with parallel convolutional neural networks (CNN) fo...
Humans are remarkably flexible in understanding viewpoint changes due to visual cortex supporting th...
Incorporating geometric transformations that reflect the relative position changes between an observ...
The objective of this paper is to rectify any monocular image by computing a homography matrix that ...
Local processing is an essential feature of CNNs and other neural network architectures - it is one ...
Images taken under different camera poses are rotated or distorted, which leads to poor perception e...
Homography is an important area of computer vision for scene understanding and plays a key role in e...
Planar homography estimation refers to the problem of computing a bijective linear mapping of pixels...
In order to perform object recognition, it is necessary to form perceptual representations that are ...
The goal of machine vision is to develop human-like visual abilities; however, it is unclear whether...
Gaze and head pose estimation can play essential roles in various applications, such as human attent...
Under the assumption of weak perspective, two views of the same planar object are related through an...
A collection of images of a scene captured from different perspectives inform us about the scene's c...
In order to perform object recognition, it is necessary to form perceptual representations that are ...
To form view-invariant representations of objects, neurons in the inferior temporal cortex may assoc...
In this work, we present a network architecture with parallel convolutional neural networks (CNN) fo...
Humans are remarkably flexible in understanding viewpoint changes due to visual cortex supporting th...