Learning how to model complex scenes in a modular way with recombinable components is a pre-requisite for higher-order reasoning and acting in the physical world. However, current generative models lack the ability to capture the inherently compositional and layered nature of visual scenes. While recent work has made progress towards unsupervised learning of object-based scene representations, most models still maintain a global representation space (i.e., objects are not explicitly separated), and cannot generate scenes with novel object arrangement and depth ordering. Here, we present an alternative approach which uses an inductive bias encouraging modularity by training an ensemble of generative models (experts). During training, experts...
Objects in scenes interact with each other in complex ways. A key observation is that these interact...
Object-centric learning has gained significant attention over the last years as it can serve as a po...
One of the key factors driving the success of machine learning for scene understanding is the develo...
Generative latent-variable models are emerging as promising tools in robotics and reinforcement lear...
Visual scenes are extremely rich in diversity, not only because there are infinite combinations of o...
Empowering machines to understand compositionality is considered by many (Lake et al., 2017; Lake an...
Current computational approaches to learning visual object categories require thousands of training ...
Deep learning has been widely used in real-life applications during the last few decades, such as fa...
We study unsupervised learning of occluding objects in images of visual scenes. The derived learning...
We present a novel causal generative model for unsupervised object-centric 3D scene understanding th...
We present a generative model of images that explicitly reasons over the set of objects they show. O...
Visual object recognition is one of the key human capabilities that we would like machines to have. ...
Abstract – An important component of higher level fusion and decision making is knowledge discovery....
A scene category imposes tight distributions over the kind of objects that might appear in the scene...
A scene category imposes tight distributions over the kind of objects that might appear in the scene...
Objects in scenes interact with each other in complex ways. A key observation is that these interact...
Object-centric learning has gained significant attention over the last years as it can serve as a po...
One of the key factors driving the success of machine learning for scene understanding is the develo...
Generative latent-variable models are emerging as promising tools in robotics and reinforcement lear...
Visual scenes are extremely rich in diversity, not only because there are infinite combinations of o...
Empowering machines to understand compositionality is considered by many (Lake et al., 2017; Lake an...
Current computational approaches to learning visual object categories require thousands of training ...
Deep learning has been widely used in real-life applications during the last few decades, such as fa...
We study unsupervised learning of occluding objects in images of visual scenes. The derived learning...
We present a novel causal generative model for unsupervised object-centric 3D scene understanding th...
We present a generative model of images that explicitly reasons over the set of objects they show. O...
Visual object recognition is one of the key human capabilities that we would like machines to have. ...
Abstract – An important component of higher level fusion and decision making is knowledge discovery....
A scene category imposes tight distributions over the kind of objects that might appear in the scene...
A scene category imposes tight distributions over the kind of objects that might appear in the scene...
Objects in scenes interact with each other in complex ways. A key observation is that these interact...
Object-centric learning has gained significant attention over the last years as it can serve as a po...
One of the key factors driving the success of machine learning for scene understanding is the develo...