We propose a computational model of visually-grounded spatial language under-standing, based on a study of how people verbally describe objects in visual scenes. We describe our implementation of word level visually-grounded semantics and their embedding in a compositional parsing frame-work. The implemented system selects the correct referents in response to a broad range of referring expressions for a large percentage of test cases. In an analysis of the system’s successes and failures we reveal how visual context influences the semantics of utterances and propose future extensions to the model that take such context into account.
This study explores the impact of visual context on the conceptual salience of a discourse entity, u...
© 2016 John Benjamins Publishing Company. This is the accepted manuscript of a chapter published in ...
Humans naturally use referring expressions with verbal utterances and nonverbal gestures to refer to...
We present a visually-grounded language understanding model based on a study of how people verbally ...
Grounding language in the physical world enables humans to use words and sentences in context and to...
AbstractThe fundamental claim of this paper is that salience—both visual and linguistic—is an import...
To what extent is the choice of what to say driven by seemingly irrelevant cues in the visual world ...
Artificial Intelligence (AI) technologies affect many facets of our daily lives. AI systems help us ...
International audienceThe way we see the objects around us determines speech and gestures we use to ...
Burigo M, Knoeferle P. Visual attention during spatial language comprehension: Is a referential link...
We present a visually grounded model of speech perception which projects spoken utterances and image...
Spatial language descriptions, such as The bottle is over the glass, direct the attention of the hea...
We use words to communicate about things and kinds of things, their properties, relations and action...
A central purpose of referring expressions is to distinguish intended referents from other entities ...
We explore contextual adaptation of referring expressions with respect to referential ambiguity and ...
This study explores the impact of visual context on the conceptual salience of a discourse entity, u...
© 2016 John Benjamins Publishing Company. This is the accepted manuscript of a chapter published in ...
Humans naturally use referring expressions with verbal utterances and nonverbal gestures to refer to...
We present a visually-grounded language understanding model based on a study of how people verbally ...
Grounding language in the physical world enables humans to use words and sentences in context and to...
AbstractThe fundamental claim of this paper is that salience—both visual and linguistic—is an import...
To what extent is the choice of what to say driven by seemingly irrelevant cues in the visual world ...
Artificial Intelligence (AI) technologies affect many facets of our daily lives. AI systems help us ...
International audienceThe way we see the objects around us determines speech and gestures we use to ...
Burigo M, Knoeferle P. Visual attention during spatial language comprehension: Is a referential link...
We present a visually grounded model of speech perception which projects spoken utterances and image...
Spatial language descriptions, such as The bottle is over the glass, direct the attention of the hea...
We use words to communicate about things and kinds of things, their properties, relations and action...
A central purpose of referring expressions is to distinguish intended referents from other entities ...
We explore contextual adaptation of referring expressions with respect to referential ambiguity and ...
This study explores the impact of visual context on the conceptual salience of a discourse entity, u...
© 2016 John Benjamins Publishing Company. This is the accepted manuscript of a chapter published in ...
Humans naturally use referring expressions with verbal utterances and nonverbal gestures to refer to...