Representations are crucial for a robot to learn effective navigation policies. Recent work has shown that mid-level perceptual abstractions, such as depth estimates or 2D semantic segmentation, lead to more effective policies when provided as observations in place of raw sensor data (e.g., RGB images). However, such policies must still learn latent three-dimensional scene properties from mid-level abstractions. In contrast, high-level, hierarchical representations such as 3D scene graphs explicitly provide a scene's geometry, topology, and semantics, making them compelling representations for navigation. In this work, we present a reinforcement learning framework that leverages high-level hierarchical representations to learn navigation po...
Robotic systems are nowadays capable of solving complex navigation tasks. However, their capabilitie...
Scene representation is the process of converting sensory observations of an environment into compac...
The ability to navigate like a human towards a language-guided target from anywhere in a 3D embodied...
It is extremely difficult to teach robots the skills that humans take for granted. Understanding the...
A fundamental task in robotics is to navigate between two locations. In particular, real-world navig...
3D scene graphs have emerged as a powerful map representation for robotics. Scene graphs encode spat...
How can a robot navigate successfully in rich and diverse environments, indoors or outdoors, along o...
Artificially intelligent agents with some degree of autonomy in the real world must learn to complet...
3D scene graphs have recently emerged as a powerful high-level representation of 3D environments. A ...
Navigating complex indoor environments requires a deep understanding of the space the robotic agent ...
Neuroscientists postulate 3D representations in the brain in a variety of different coordinate frame...
A major current challenge in reinforcement learning re-search is to extend methods that work well on...
In this work visual navigation task in realistic simulated environment is formulated and solved usin...
Endowing robots with human-like physical reasoning abilities remains challenging. We argue that exis...
We train embodied neural networks to plan and navigate unseen complex 3D environments, emphasising r...
Robotic systems are nowadays capable of solving complex navigation tasks. However, their capabilitie...
Scene representation is the process of converting sensory observations of an environment into compac...
The ability to navigate like a human towards a language-guided target from anywhere in a 3D embodied...
It is extremely difficult to teach robots the skills that humans take for granted. Understanding the...
A fundamental task in robotics is to navigate between two locations. In particular, real-world navig...
3D scene graphs have emerged as a powerful map representation for robotics. Scene graphs encode spat...
How can a robot navigate successfully in rich and diverse environments, indoors or outdoors, along o...
Artificially intelligent agents with some degree of autonomy in the real world must learn to complet...
3D scene graphs have recently emerged as a powerful high-level representation of 3D environments. A ...
Navigating complex indoor environments requires a deep understanding of the space the robotic agent ...
Neuroscientists postulate 3D representations in the brain in a variety of different coordinate frame...
A major current challenge in reinforcement learning re-search is to extend methods that work well on...
In this work visual navigation task in realistic simulated environment is formulated and solved usin...
Endowing robots with human-like physical reasoning abilities remains challenging. We argue that exis...
We train embodied neural networks to plan and navigate unseen complex 3D environments, emphasising r...
Robotic systems are nowadays capable of solving complex navigation tasks. However, their capabilitie...
Scene representation is the process of converting sensory observations of an environment into compac...
The ability to navigate like a human towards a language-guided target from anywhere in a 3D embodied...