Many modern applications require extracting the core attributes of human behavior such as a person\u27s attention, intent, or skill level from the visual data. There are two main challenges related to this problem. First, we need models that can represent visual data in terms of object-level cues. Second, we need models that can infer the core behavioral attributes from the visual data. We refer to these two challenges as ``learning to see\u27\u27, and ``seeing to learn\u27\u27 respectively. In this PhD thesis, we have made progress towards addressing both challenges. We tackle the problem of ``learning to see\u27\u27 by developing methods that extract object-level information directly from raw visual data. This includes, two top-down conto...
We posit that a person's gaze behavior while freely viewing a scene contains an abundance of informa...
Humans move their eyes in order to learn visual representations of the world. These eye movements de...
Deep learning has significantly advanced computer vision in the past decade, paving the way for prac...
Many modern applications require extracting the core attributes of human behavior such as a person\u...
Many modern applications require extracting the core attributes of human behavior such as a person\u...
Machine learning is an important multidisciplinary field of research, which aims to construct models...
Understanding how to use a physical environment is a key requirement for embodied AI agents operatin...
The human visual system is capable of interpreting a remarkable variety of often subtle, learnt, cha...
Fundamental to many tasks in the field of computer vision, this work considers the understanding of ...
A core problem in visual object learning is using a finite number of images of a new object to accur...
In order to allow robots to act autonomously it is crucial that they do not only describe their envi...
Videos captured from wearable cameras, known as egocentric videos, create a continuous record of hum...
Understanding human motions and activities in images and videos is an important problem in many appl...
Thesis (Ph.D.)--University of Washington, 2020With increasingly high interest in assistive robots an...
Humans move their eyes in order to learn visual representations of the world. These eye movements de...
We posit that a person's gaze behavior while freely viewing a scene contains an abundance of informa...
Humans move their eyes in order to learn visual representations of the world. These eye movements de...
Deep learning has significantly advanced computer vision in the past decade, paving the way for prac...
Many modern applications require extracting the core attributes of human behavior such as a person\u...
Many modern applications require extracting the core attributes of human behavior such as a person\u...
Machine learning is an important multidisciplinary field of research, which aims to construct models...
Understanding how to use a physical environment is a key requirement for embodied AI agents operatin...
The human visual system is capable of interpreting a remarkable variety of often subtle, learnt, cha...
Fundamental to many tasks in the field of computer vision, this work considers the understanding of ...
A core problem in visual object learning is using a finite number of images of a new object to accur...
In order to allow robots to act autonomously it is crucial that they do not only describe their envi...
Videos captured from wearable cameras, known as egocentric videos, create a continuous record of hum...
Understanding human motions and activities in images and videos is an important problem in many appl...
Thesis (Ph.D.)--University of Washington, 2020With increasingly high interest in assistive robots an...
Humans move their eyes in order to learn visual representations of the world. These eye movements de...
We posit that a person's gaze behavior while freely viewing a scene contains an abundance of informa...
Humans move their eyes in order to learn visual representations of the world. These eye movements de...
Deep learning has significantly advanced computer vision in the past decade, paving the way for prac...