Consider a domestic robot being asked to pick up "the cup nearest to the plate". Natural language is an intuitive way for humans to interact with robots. However, enabling robots to comprehend natural language, and correctly interpret spatial references, is challenging for two reasons. Firstly, phrases must be semantically represented in structures that can be processed computationally; secondly correspondences must be found to map these structures to models that represent objects, relationships and actions in the environment. Recently neural networks have demonstrated a strong potential to address both challenges, most notably in the context of Visual Question Answering (VQA) where they have performed well at answering natural language que...
Referring to objects in a natural and unambiguous manner is crucial for effective human-robot intera...
In verbal human-robot interaction natural language utterances have to be grounded in visual scenes b...
This paper describes a framework that enables robots to effi-ciently learn human-centric models of t...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
In order for a robot to collaborate with a human to achieve a goal, the robot should be able to comm...
We present a cognitively plausible novel framework capable of learning the grounding in visual seman...
The current state of the art in military and first responder ground robots involves heavy physical a...
A robot that can carry out a natural-language instruction has been a dream since before the Jetsons ...
A robot's ability to understand or ground natural language instructions is fundamentally tied to its...
What semantic structures can enable a system to understand and use spatial language in realistic sit...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
A robot that can carry out a natural-language instruction has been a dream since before the Jetsons ...
Vision and Language Navigation (VLN) problem demands a robot to navigate accurately by combining the...
This paper describes a neural network model for the study of spatial language. It deals with both ge...
A robot that can carry out a natural-language instruction has been a dream since before the Jetsons ...
Referring to objects in a natural and unambiguous manner is crucial for effective human-robot intera...
In verbal human-robot interaction natural language utterances have to be grounded in visual scenes b...
This paper describes a framework that enables robots to effi-ciently learn human-centric models of t...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
In order for a robot to collaborate with a human to achieve a goal, the robot should be able to comm...
We present a cognitively plausible novel framework capable of learning the grounding in visual seman...
The current state of the art in military and first responder ground robots involves heavy physical a...
A robot that can carry out a natural-language instruction has been a dream since before the Jetsons ...
A robot's ability to understand or ground natural language instructions is fundamentally tied to its...
What semantic structures can enable a system to understand and use spatial language in realistic sit...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
A robot that can carry out a natural-language instruction has been a dream since before the Jetsons ...
Vision and Language Navigation (VLN) problem demands a robot to navigate accurately by combining the...
This paper describes a neural network model for the study of spatial language. It deals with both ge...
A robot that can carry out a natural-language instruction has been a dream since before the Jetsons ...
Referring to objects in a natural and unambiguous manner is crucial for effective human-robot intera...
In verbal human-robot interaction natural language utterances have to be grounded in visual scenes b...
This paper describes a framework that enables robots to effi-ciently learn human-centric models of t...