One of the most common vision problems is Video based Action Recognition. Many public datasets, public contests, and so on, boosted the development of new methods to face the challenges posed by this problem. Deep Learning is by far the most used technique to address Video-based Action Recognition problem. The common issue for these methods is the well-known dependency from training data. Methods are effective when training and test data are extracted from the same distribution. However, in real situations, this is not always the case. When test data has a different distribution than training one, methods result in considerable drop in performances. A solution to this issue is the so-called Domain Adaptation technique, whose goal is to cons...
In this report, we present the technical details of our submission to the 2022 EPIC-Kitchens Unsuper...
The number of application areas of deep neural networks for image classification is continuously gro...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
Video-based Unsupervised Domain Adaptation (VUDA) methods improve the robustness of video models, en...
We apply domain adaptation to the problem of recognizing common actions between differing court-game...
Over the last few years, Unsupervised Domain Adaptation (UDA) techniques have acquired remarkable im...
Unsupervised domain adaptation (UDA) methods have become very popular in computer vision. However, w...
Domain adaptation (DA) approaches address domain shift and enable networks to be applied to differen...
In visual recognition problems, the common data distribution mismatches between training and testing...
Recent advances in big data systems and databases have made it possible to gather raw unlabeled data...
Video violence detection is a subset of human action recognition aiming to detect violent behaviors ...
The advent of deep convolutional networks has powered a new wave of progress in visual recognition. ...
Apparent motion information of an action may vary dramatically from one view to another, making tran...
In pattern recognition and computer vision, one is often faced with scenarios where the training dat...
2015-07-23In many applications (computer vision, natural language processing, speech recognition, et...
In this report, we present the technical details of our submission to the 2022 EPIC-Kitchens Unsuper...
The number of application areas of deep neural networks for image classification is continuously gro...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
Video-based Unsupervised Domain Adaptation (VUDA) methods improve the robustness of video models, en...
We apply domain adaptation to the problem of recognizing common actions between differing court-game...
Over the last few years, Unsupervised Domain Adaptation (UDA) techniques have acquired remarkable im...
Unsupervised domain adaptation (UDA) methods have become very popular in computer vision. However, w...
Domain adaptation (DA) approaches address domain shift and enable networks to be applied to differen...
In visual recognition problems, the common data distribution mismatches between training and testing...
Recent advances in big data systems and databases have made it possible to gather raw unlabeled data...
Video violence detection is a subset of human action recognition aiming to detect violent behaviors ...
The advent of deep convolutional networks has powered a new wave of progress in visual recognition. ...
Apparent motion information of an action may vary dramatically from one view to another, making tran...
In pattern recognition and computer vision, one is often faced with scenarios where the training dat...
2015-07-23In many applications (computer vision, natural language processing, speech recognition, et...
In this report, we present the technical details of our submission to the 2022 EPIC-Kitchens Unsuper...
The number of application areas of deep neural networks for image classification is continuously gro...
This thesis focuses on video understanding for human action and interaction recognition. We start by...