Empowering machines to understand compositionality is considered by many (Lake et al., 2017; Lake and Baroni, 2018; Schölkopf et al., 2021) a promising path towards improved representational interpretability and out-of-distribution generalization. Yet, discovering the compositional structures of raw sensory data requires solving a factorization problem, i.e. decomposing the unstructured observations into modular components. Handling the factorization problem presents numerous technical challenges, especially in unsupervised settings which we explore to avoid the heavy burden of human annotation. In this thesis, we approach the factorization problem from a generative perspective. Specifically, we develop unsupervised machine learning models ...
We present a novel causal generative model for unsupervised object-centric 3D scene understanding th...
Natural images arise from complicated processes involving many factors of variation. They reflect t...
Institute for Adaptive and Neural ComputationDeveloping computer vision algorithms able to learn fr...
Visual scenes are extremely rich in diversity, not only because there are infinite combinations of o...
Learning how to model complex scenes in a modular way with recombinable components is a pre-requisit...
Object-centric learning has gained significant attention over the last years as it can serve as a po...
The appearance of the same object may vary in different scene images due to perspectives and occlusi...
Visual scene representation learning is an important research problem in the field of computer visio...
Generative latent-variable models are emerging as promising tools in robotics and reinforcement lear...
What constitutes a scene? Defining a meaningful vocabulary for scene discovery is a challenging prob...
Humans develop a common-sense understanding of the physical behaviour of the world, within the firs...
3D generative models of objects enable photorealistic image synthesis with 3Dcontrol. Existing metho...
One of the key factors driving the success of machine learning for scene understanding is the develo...
Systematic compositionality, or the ability to adapt to novel situations by creating a mental model ...
We present a framework called Acquired Deep Impressions (ADI) which continuously learns knowledge of...
We present a novel causal generative model for unsupervised object-centric 3D scene understanding th...
Natural images arise from complicated processes involving many factors of variation. They reflect t...
Institute for Adaptive and Neural ComputationDeveloping computer vision algorithms able to learn fr...
Visual scenes are extremely rich in diversity, not only because there are infinite combinations of o...
Learning how to model complex scenes in a modular way with recombinable components is a pre-requisit...
Object-centric learning has gained significant attention over the last years as it can serve as a po...
The appearance of the same object may vary in different scene images due to perspectives and occlusi...
Visual scene representation learning is an important research problem in the field of computer visio...
Generative latent-variable models are emerging as promising tools in robotics and reinforcement lear...
What constitutes a scene? Defining a meaningful vocabulary for scene discovery is a challenging prob...
Humans develop a common-sense understanding of the physical behaviour of the world, within the firs...
3D generative models of objects enable photorealistic image synthesis with 3Dcontrol. Existing metho...
One of the key factors driving the success of machine learning for scene understanding is the develo...
Systematic compositionality, or the ability to adapt to novel situations by creating a mental model ...
We present a framework called Acquired Deep Impressions (ADI) which continuously learns knowledge of...
We present a novel causal generative model for unsupervised object-centric 3D scene understanding th...
Natural images arise from complicated processes involving many factors of variation. They reflect t...
Institute for Adaptive and Neural ComputationDeveloping computer vision algorithms able to learn fr...