Indoor scene recognition is a growing field with great potential for behaviour understanding, robot localization, and elderly monitoring, among others. In this study, we approach the task of scene recognition from a novel standpoint, using multi-modal learning and video data gathered from social media. The accessibility and variety of social media videos can provide realistic data for modern scene recognition techniques and applications. We propose a model based on fusion of transcribed speech to text and visual features, which is used for classification on a novel dataset of social media videos of indoor scenes named InstaIndoor. Our model achieves up to 70% accuracy and 0.7 F1-Score. Furthermore, we highlight the potential of our approach...
With the rapid development of indoor localization in recent years; signals of opportunity have becom...
Deep learning (DL) models have emerged in recent years as the state-of-the-art technique across nume...
This thesis is concerned with multimodal machine learning for digital humanities. Multimodal machine...
Indoor scene recognition is a growing field with great potential for behaviour understanding, robot ...
The use of deep learning techniques has exploded during the last few years, resulting in a direct co...
Deep learning has resulted in ground-breaking progress in a variety of domains, from core machine le...
In this paper, we propose a multimodal deep learning architecture for emotion recognition in video r...
Indoor scene recognition and semantic information can be useful for social robots. Recently, in the ...
Many smart home applications rely on indoor human activity recognition. This challenge is currently ...
Indoor scene recognition and semantic information can be helpful for social robots. Recently, in the...
University of Technology Sydney. Faculty of Engineering and Information Technology.Multi-modal perce...
There has been a tremendous increase in internet users and enough bandwidth in recent years. Because...
Currently, in the field of computer vision, video behaviour recognition has become a hot content and...
Indoor location-based services constitute an important part of our daily lives, providing position a...
© 2017 IEEE. Video-based human action recognition has become one of the most popular research areas ...
With the rapid development of indoor localization in recent years; signals of opportunity have becom...
Deep learning (DL) models have emerged in recent years as the state-of-the-art technique across nume...
This thesis is concerned with multimodal machine learning for digital humanities. Multimodal machine...
Indoor scene recognition is a growing field with great potential for behaviour understanding, robot ...
The use of deep learning techniques has exploded during the last few years, resulting in a direct co...
Deep learning has resulted in ground-breaking progress in a variety of domains, from core machine le...
In this paper, we propose a multimodal deep learning architecture for emotion recognition in video r...
Indoor scene recognition and semantic information can be useful for social robots. Recently, in the ...
Many smart home applications rely on indoor human activity recognition. This challenge is currently ...
Indoor scene recognition and semantic information can be helpful for social robots. Recently, in the...
University of Technology Sydney. Faculty of Engineering and Information Technology.Multi-modal perce...
There has been a tremendous increase in internet users and enough bandwidth in recent years. Because...
Currently, in the field of computer vision, video behaviour recognition has become a hot content and...
Indoor location-based services constitute an important part of our daily lives, providing position a...
© 2017 IEEE. Video-based human action recognition has become one of the most popular research areas ...
With the rapid development of indoor localization in recent years; signals of opportunity have becom...
Deep learning (DL) models have emerged in recent years as the state-of-the-art technique across nume...
This thesis is concerned with multimodal machine learning for digital humanities. Multimodal machine...