Language is such a powerful representation for capturing the knowledge and information about our world. It excels at expressing discrete concepts such as objects and their attributes, the relationships between them in a very compact manner all due to its extremely high level of abstraction. Language is the primary means by which we communicate, comprehend, and express our thoughts and ideas, and it lies at the very core of human intelligence. With the advent of powerful generative models, machines also have begun to comprehend and generate natural language with notable fluency and creativity. However, they lack “grounding”—a direct tie to the visual world. Vision plays a pivotal role in our comprehension and production of language. When we ...
Artificial Intelligence (AI) has transformed the way we interact with technology e.g., chatbots, voi...
Materials are the building blocks of our surroundings. Material perception enables us to create a vi...
Recently, there has been an increasing number of efforts to introduce models capable of generating n...
Powered by deep convolutional networks and large scale visual datasets, modern computer vision syste...
Powered by deep convolutional networks and large scale visual datasets, modern computer vision syste...
Grounding natural language onto real-world perception is a fundamental challenge to empower various ...
Grounding natural language onto real-world perception is a fundamental challenge to empower various ...
This electronic version was submitted by the student author. The certified thesis is available in th...
Each time we ask for an object, describe a scene, follow directions or read a document containi...
154 pagesOver the course of the last decades, we have witnessed the significant progress of machine ...
Generating images from textual descriptions has gained a lot of attention. Recently, DALL-E, a multi...
One ultimate goal of AI is to develop an artificial intelligent (AI) system that can communicate wit...
Large language models are known to suffer from the hallucination problem in that they are prone to o...
Paper accepted for presentation at the ViGIL 2021 workshop @NAACL. This version: added models to the...
Large language models are known to suffer from the hallucination problem in that they are prone to o...
Artificial Intelligence (AI) has transformed the way we interact with technology e.g., chatbots, voi...
Materials are the building blocks of our surroundings. Material perception enables us to create a vi...
Recently, there has been an increasing number of efforts to introduce models capable of generating n...
Powered by deep convolutional networks and large scale visual datasets, modern computer vision syste...
Powered by deep convolutional networks and large scale visual datasets, modern computer vision syste...
Grounding natural language onto real-world perception is a fundamental challenge to empower various ...
Grounding natural language onto real-world perception is a fundamental challenge to empower various ...
This electronic version was submitted by the student author. The certified thesis is available in th...
Each time we ask for an object, describe a scene, follow directions or read a document containi...
154 pagesOver the course of the last decades, we have witnessed the significant progress of machine ...
Generating images from textual descriptions has gained a lot of attention. Recently, DALL-E, a multi...
One ultimate goal of AI is to develop an artificial intelligent (AI) system that can communicate wit...
Large language models are known to suffer from the hallucination problem in that they are prone to o...
Paper accepted for presentation at the ViGIL 2021 workshop @NAACL. This version: added models to the...
Large language models are known to suffer from the hallucination problem in that they are prone to o...
Artificial Intelligence (AI) has transformed the way we interact with technology e.g., chatbots, voi...
Materials are the building blocks of our surroundings. Material perception enables us to create a vi...
Recently, there has been an increasing number of efforts to introduce models capable of generating n...