University of Technology Sydney. Faculty of Engineering and Information Technology.Video understanding is a complex task in computer vision, which requires not only recognizing objects, persons, and scenes, but also capturing and remembering the changes of visual content along time. Rapid development in building blocks like image classification task in recent years provides great opportunities for accurate and efficient video understanding. Based on deep convolutional neural networks and recurrent neural networks, various kinds of deep learning applications on video understanding have been studied. In this thesis, I present my research on large-scale video analysis and understanding in three major aspects: video representation learning, rec...
The process of identifying a specific event from a video is a relatively easy task for humans. Howev...
In this thesis, we investigate different representations and models for large-scale video understand...
The aim of this PhD thesis is to make a step forward towards teaching computers to understand videos...
Recently, the broad adoption of the internet coupled with connected smart devices has seen a signifi...
The field of computer vision has long strived to extract understanding from images and videos sequen...
Video understanding is one of the fundamental problems in computer vision. Videos provide more infor...
For most people, watching a brief video and describing what happened (in words) is an easy task. For...
abstract: Video analysis and understanding have obtained more and more attention in recent years. Th...
Deep learning has resulted in ground-breaking progress in a variety of domains, from core machine le...
Vision to language problems, such as video annotation, or visual question answering, stand out from ...
With the exponential growth of the digital data, video content analysis (e.g., action, event recogni...
In recent times, digital media contents are inherently of multimedia type, consisting of the form te...
University of Technology Sydney. Faculty of Engineering and Information Technology.Multi-modal perce...
A long standing goal of artificial intelligence is to enable machines to perceive the visual world a...
Understanding visual media, i.e. images and videos, has been a cornerstone topic in computer vision ...
The process of identifying a specific event from a video is a relatively easy task for humans. Howev...
In this thesis, we investigate different representations and models for large-scale video understand...
The aim of this PhD thesis is to make a step forward towards teaching computers to understand videos...
Recently, the broad adoption of the internet coupled with connected smart devices has seen a signifi...
The field of computer vision has long strived to extract understanding from images and videos sequen...
Video understanding is one of the fundamental problems in computer vision. Videos provide more infor...
For most people, watching a brief video and describing what happened (in words) is an easy task. For...
abstract: Video analysis and understanding have obtained more and more attention in recent years. Th...
Deep learning has resulted in ground-breaking progress in a variety of domains, from core machine le...
Vision to language problems, such as video annotation, or visual question answering, stand out from ...
With the exponential growth of the digital data, video content analysis (e.g., action, event recogni...
In recent times, digital media contents are inherently of multimedia type, consisting of the form te...
University of Technology Sydney. Faculty of Engineering and Information Technology.Multi-modal perce...
A long standing goal of artificial intelligence is to enable machines to perceive the visual world a...
Understanding visual media, i.e. images and videos, has been a cornerstone topic in computer vision ...
The process of identifying a specific event from a video is a relatively easy task for humans. Howev...
In this thesis, we investigate different representations and models for large-scale video understand...
The aim of this PhD thesis is to make a step forward towards teaching computers to understand videos...