In order to make the best use of multimedia contents effec-tively, the crucial point is the structural analysis of the con-tents, in which several media processing techniques, includ-ing image, audio and text analyses, should be integrated. To understand utterances in videos in accordance with the scene, it is essential to recognize what object appears in the videos. In this paper, we focus on Japanese cooking TV videos, and propose a method for acquiring object models of foods in an unsupervised manner and performing object recognition based on the acquired object models. First, a topic of each video segment is identified based on HMMs to obtain good examples for the object model acquisition. After that, close-up images are extracted from ...
In this paper we present a novel technique for object modeling and object recognition in video. Give...
With the number of videos growing rapidly in modern society, automatically recognizing objects from ...
This paper integrates techniques in natural language processing and computer vision to improve recog...
To perform real-word information processing, such as intelligent robotics, multimodal dialogue syste...
To perform real-word information processing, such as intelligent robotics, multimodal dialogue syste...
In this paper we introduce two novel methods for object recognition from video. Our major contributi...
Abstract. In realizing video retrieval system, the crucial point is how to provide an effective acce...
In conjunction with the 27th International Joint Conference on Artificial IntelligenceInternational ...
We propose a new approach to object recognition problem motivated by the availability of large annot...
The analysis of video for semantics presents significant advantages for many content-based applicati...
We present a novel method for aligning a se-quence of instructions to a video of some-one carrying o...
We propose a new approach to recognize objects and scenes in news videos motivated by the availabili...
In this dissertation, we discuss our work on analyzing cooking content for the ultimate goal ofautom...
This paper presents an unsupervised topic identification method integrating linguis-tic and visual i...
tokyo.ac.jp In this paper, we propose a method to abstract cooking videos. We define cooking video a...
In this paper we present a novel technique for object modeling and object recognition in video. Give...
With the number of videos growing rapidly in modern society, automatically recognizing objects from ...
This paper integrates techniques in natural language processing and computer vision to improve recog...
To perform real-word information processing, such as intelligent robotics, multimodal dialogue syste...
To perform real-word information processing, such as intelligent robotics, multimodal dialogue syste...
In this paper we introduce two novel methods for object recognition from video. Our major contributi...
Abstract. In realizing video retrieval system, the crucial point is how to provide an effective acce...
In conjunction with the 27th International Joint Conference on Artificial IntelligenceInternational ...
We propose a new approach to object recognition problem motivated by the availability of large annot...
The analysis of video for semantics presents significant advantages for many content-based applicati...
We present a novel method for aligning a se-quence of instructions to a video of some-one carrying o...
We propose a new approach to recognize objects and scenes in news videos motivated by the availabili...
In this dissertation, we discuss our work on analyzing cooking content for the ultimate goal ofautom...
This paper presents an unsupervised topic identification method integrating linguis-tic and visual i...
tokyo.ac.jp In this paper, we propose a method to abstract cooking videos. We define cooking video a...
In this paper we present a novel technique for object modeling and object recognition in video. Give...
With the number of videos growing rapidly in modern society, automatically recognizing objects from ...
This paper integrates techniques in natural language processing and computer vision to improve recog...