Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2018.This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.Cataloged from student-submitted PDF version of thesis.Includes bibliographical references (pages 171-192).Multimodal documents occur in a variety of forms, as graphs in technical reports, diagrams in textbooks, and graphic designs in bulletins. Humans can efficiently process the visual and textual information contained within to make decisions on topics including business, healthcare, and science. Building the computational tools to understand multimodal documents can have import...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Gaps and requirements for multi-modal interfaces for humanities can be explored by observing the con...
Right now you are reading a sentence. Earlier, you might have been looking at a realistic picture, s...
From the gestures that accompany speech to images in social media posts, humans effortlessly combine...
New advances in animation, scientific visualization, and graphical user interfaces make it essential...
In this paper, we propose a computational architecture for multimodal comprehension of text and grap...
Presented online on October 28, 2020 at 12:15 p.m.Adriana Kovashka is an Assistant Professor in Comp...
Recent years have seen an explosion in multimodal data on the web. It is therefore important to perf...
Visuals play a vital role in human communication in the modern media landscape, but there have been ...
My dissertation focuses on developing computational models of eye movements for understanding how co...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Visual search is an important part of human-computer interaction (HCI). The visual search processes ...
This paper presents a data model for images immersed in the world wide web and that derive their mea...
Abstract. This article deals with the visualization of textual information, and provides a descripti...
This thesis explores multimodal document classification algorithms in a unified framework. Classific...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Gaps and requirements for multi-modal interfaces for humanities can be explored by observing the con...
Right now you are reading a sentence. Earlier, you might have been looking at a realistic picture, s...
From the gestures that accompany speech to images in social media posts, humans effortlessly combine...
New advances in animation, scientific visualization, and graphical user interfaces make it essential...
In this paper, we propose a computational architecture for multimodal comprehension of text and grap...
Presented online on October 28, 2020 at 12:15 p.m.Adriana Kovashka is an Assistant Professor in Comp...
Recent years have seen an explosion in multimodal data on the web. It is therefore important to perf...
Visuals play a vital role in human communication in the modern media landscape, but there have been ...
My dissertation focuses on developing computational models of eye movements for understanding how co...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Visual search is an important part of human-computer interaction (HCI). The visual search processes ...
This paper presents a data model for images immersed in the world wide web and that derive their mea...
Abstract. This article deals with the visualization of textual information, and provides a descripti...
This thesis explores multimodal document classification algorithms in a unified framework. Classific...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Gaps and requirements for multi-modal interfaces for humanities can be explored by observing the con...
Right now you are reading a sentence. Earlier, you might have been looking at a realistic picture, s...