One ultimate goal of AI is to develop an artificial intelligent (AI) system that can communicate with people in a natural way. Such communication includes but is not limited to asking we humans questions, answering our questions, conducting dialogue with human beings, and performing some actions to better serve people. Imagine in the future where the service robot is everywhere, and we could ask our home robot to “grab me the red cup on the table.” To perform this command, the AI system needs to understand this spoken English sentence, perceive the visual world, navigate to the right place “table”, recognize the right object “the red cup”, then grab it and finally return it back to the commander. Just for this single command, it already inv...
Language is such a powerful representation for capturing the knowledge and information about our wor...
Using deep learning, computer vision now rivals people at object recognition and detection, opening ...
A robot that can carry out a natural-language instruction has been a dream since before the Jetsons ...
The world around us involves multiple modalities -- we see objects, feel texture, hear sounds, smell...
Artificial Intelligence (AI) has transformed the way we interact with technology e.g., chatbots, voi...
Humans have amazing visual perception which allows them to comprehend what the eyes see. In the core...
Artificial Intelligence (AI) technologies affect many facets of our daily lives. AI systems help us ...
In this thesis I describe an operational implementation of an object detection and description syste...
In this dissertation, I propose and study a multi-modal Artificial Intelligence (AI) task called Vis...
Since most worldly phenomena can be expressed via language, language is a crucial medium for transfe...
In this dissertation, I propose and study a multi-modal Artificial Intelligence (AI) task called Vis...
A robot that can carry out a natural-language instruction has been a dream since before the Jetsons ...
Replicating a human-level understanding of the physical world in computers is a monumental task. Ach...
Understanding how to model computer vision and natural language jointly is a long-standing challenge...
Understanding how to model computer vision and natural language jointly is a long-standing challenge...
Language is such a powerful representation for capturing the knowledge and information about our wor...
Using deep learning, computer vision now rivals people at object recognition and detection, opening ...
A robot that can carry out a natural-language instruction has been a dream since before the Jetsons ...
The world around us involves multiple modalities -- we see objects, feel texture, hear sounds, smell...
Artificial Intelligence (AI) has transformed the way we interact with technology e.g., chatbots, voi...
Humans have amazing visual perception which allows them to comprehend what the eyes see. In the core...
Artificial Intelligence (AI) technologies affect many facets of our daily lives. AI systems help us ...
In this thesis I describe an operational implementation of an object detection and description syste...
In this dissertation, I propose and study a multi-modal Artificial Intelligence (AI) task called Vis...
Since most worldly phenomena can be expressed via language, language is a crucial medium for transfe...
In this dissertation, I propose and study a multi-modal Artificial Intelligence (AI) task called Vis...
A robot that can carry out a natural-language instruction has been a dream since before the Jetsons ...
Replicating a human-level understanding of the physical world in computers is a monumental task. Ach...
Understanding how to model computer vision and natural language jointly is a long-standing challenge...
Understanding how to model computer vision and natural language jointly is a long-standing challenge...
Language is such a powerful representation for capturing the knowledge and information about our wor...
Using deep learning, computer vision now rivals people at object recognition and detection, opening ...
A robot that can carry out a natural-language instruction has been a dream since before the Jetsons ...