With the massive increase of video content on Internet and beyond, the automatic understanding of visual content could impact many different application fields such as robotics, health care, content search or filtering. The goal of this thesis is to provide methodological contributions in Computer Vision and Machine Learning for automatic content understanding from videos. We emphasis on problems, namely fine-grained human action recognition and visual reasoning from object-level interactions. In the first part of this manuscript, we tackle the problem of fine-grained human action recognition. We introduce two different trained attention mechanisms on the visual content from articulated human pose. The first method is able to automatically ...
This thesis addresses the problem of human action recognition in realistic video data, such as movie...
This thesis focuses on the problem of understanding visual contents, which can be images, videos or ...
Cette thèse étudie le problème de la reconnaissance d’actions humaines dans des vidéos. La reconnais...
With the rapid growth of digital video content, automaticvideo understanding has become an increasin...
With the rapid growth of digital video content, automaticvideo understanding has become an increasin...
With the rapid growth of digital video content, automaticvideo understanding has become an increasin...
With the rapid growth of digital video content, automaticvideo understanding has become an increasin...
Automatic video understanding is expected to impact our lives through many applications such as auto...
Automatic video understanding is expected to impact our lives through many applications such as auto...
This thesis targets recognition of human actions in videos. This problem can be defined as the abili...
Dans cette thèse nous étudions la reconnaissance d'actions humaines. Typiquement, différentes action...
In this thesis, we focus on modeling interactions between people, objects and scenes and show benefi...
In this thesis, we focus on modeling interactions between people, objects and scenes and show benefi...
In this thesis, we focus on modeling interactions between people, objects and scenes and show benefi...
The main objective of the thesis is to propose a complete framework for the automatic activity disco...
This thesis addresses the problem of human action recognition in realistic video data, such as movie...
This thesis focuses on the problem of understanding visual contents, which can be images, videos or ...
Cette thèse étudie le problème de la reconnaissance d’actions humaines dans des vidéos. La reconnais...
With the rapid growth of digital video content, automaticvideo understanding has become an increasin...
With the rapid growth of digital video content, automaticvideo understanding has become an increasin...
With the rapid growth of digital video content, automaticvideo understanding has become an increasin...
With the rapid growth of digital video content, automaticvideo understanding has become an increasin...
Automatic video understanding is expected to impact our lives through many applications such as auto...
Automatic video understanding is expected to impact our lives through many applications such as auto...
This thesis targets recognition of human actions in videos. This problem can be defined as the abili...
Dans cette thèse nous étudions la reconnaissance d'actions humaines. Typiquement, différentes action...
In this thesis, we focus on modeling interactions between people, objects and scenes and show benefi...
In this thesis, we focus on modeling interactions between people, objects and scenes and show benefi...
In this thesis, we focus on modeling interactions between people, objects and scenes and show benefi...
The main objective of the thesis is to propose a complete framework for the automatic activity disco...
This thesis addresses the problem of human action recognition in realistic video data, such as movie...
This thesis focuses on the problem of understanding visual contents, which can be images, videos or ...
Cette thèse étudie le problème de la reconnaissance d’actions humaines dans des vidéos. La reconnais...