This thesis explores how a computer can learn the structure of visual objects in the absence of strong supervision using self-supervised learning. We demonstrate that we can learn structural representations of objects using an autoencoding framework with reconstruction as the key learning signal. We do this by engineering bottlenecks that disentangle object structure from other factors of variation. Moreover, we design the bottlenecks to represent the object structure in the form of 2D and 3D object landmarks or 3D mesh. Specifically, we develop a method that automatically discovers 2D object landmarks without any annotations using a conditional autoencoder with 2D keypoint bottleneck that disentangles pose, represented as 2D keypoints, and...
We address the task of inferring the 3D structure underlying an image, in particular focusing on two...
We propose a new method for recognizing the pose of objects from a single image that for learning us...
Learning representations for point clouds is an important task in 3D computer vision, especially wit...
Structured representations such as keypoints are widely used in pose transfer, conditional image gen...
We propose a method to learn 3D deformable object categories from raw single-view images, without ex...
This thesis explores how to harness neural networks to learn 3D structure from visual data. Being ab...
We propose a method to learn 3D deformable object categories from raw single-view images, without ex...
International audienceIn this work we introduce Lifting Autoencoders, a generative 3D surface-based...
We propose a method to learn 3D deformable object categories from raw single-view images, without ex...
International audienceIn this work we introduce Lifting Autoencoders, a generative 3D surface-based...
International audienceIn this work we introduce Lifting Autoencoders, a generative 3D surface-based...
Learning automatically the structure of object categories remains an important open problem in compu...
This thesis tackles the challenge of learning the abstract structure of object categories without ma...
This thesis tackles the challenge of learning the abstract structure of object categories without ma...
We address the task of inferring the 3D structure underlying an image, in particular focusing on two...
We address the task of inferring the 3D structure underlying an image, in particular focusing on two...
We propose a new method for recognizing the pose of objects from a single image that for learning us...
Learning representations for point clouds is an important task in 3D computer vision, especially wit...
Structured representations such as keypoints are widely used in pose transfer, conditional image gen...
We propose a method to learn 3D deformable object categories from raw single-view images, without ex...
This thesis explores how to harness neural networks to learn 3D structure from visual data. Being ab...
We propose a method to learn 3D deformable object categories from raw single-view images, without ex...
International audienceIn this work we introduce Lifting Autoencoders, a generative 3D surface-based...
We propose a method to learn 3D deformable object categories from raw single-view images, without ex...
International audienceIn this work we introduce Lifting Autoencoders, a generative 3D surface-based...
International audienceIn this work we introduce Lifting Autoencoders, a generative 3D surface-based...
Learning automatically the structure of object categories remains an important open problem in compu...
This thesis tackles the challenge of learning the abstract structure of object categories without ma...
This thesis tackles the challenge of learning the abstract structure of object categories without ma...
We address the task of inferring the 3D structure underlying an image, in particular focusing on two...
We address the task of inferring the 3D structure underlying an image, in particular focusing on two...
We propose a new method for recognizing the pose of objects from a single image that for learning us...
Learning representations for point clouds is an important task in 3D computer vision, especially wit...