Spatial localization in time is vital for humans. Therefore we desire that computer vision algorithms are also able to spatially and temporally localize objects and actions. These algorithms generally learn from given data and discover patterns, parts, motions, and their locations by exploiting inductive biases that are essential for learning. However, localization is complex, error-prone and hard to inspect. In this thesis, we investigate location biases and how CNNs explore and exploit location and temporal information in the image and video domain. An interesting finding of the thesis is that heuristics about what is outside the image (border handling) enables CNNs to exploit absolute spatial location and break translation equivariance. ...
Visual place recognition is the task of automatically recognizing a previously visited location thro...
This dissertation addresses the problem of learning video representations, which is defined here as ...
2018-08-02Object localization is a crucial step for computers to understand an image. An object loca...
Locality is taken as the surrounding or nearby region of something in the world. Without the notion ...
The rise of deep learning has facilitated remarkable progress in video understanding. This thesis ad...
Egocentric vision is a technology that exists in a variety of fields such as life-logging, sports re...
Egocentric vision is a technology that exists in a variety of fields such as life-logging, sports re...
International audienceWe propose an effective approach for action localization, both in the spatial ...
We introduce an approach for spatio-temporal human action localization using sparse spatial supervis...
Human behavior understanding is a fundamental problem of computer vision. It is an important compone...
This paper strives for spatio-temporal localization of human actions in videos. In the literature, t...
For several emerging technologies such as augmented reality, autonomous driving and robotics, visual...
In this thesis the problem of automatic human action recognition and localization in videos is studi...
This thesis explores several practical applications of computer vision using learning based techniqu...
Free to read on publisher's website Convolutional Neural Networks (CNNs) have recently been shown to...
Visual place recognition is the task of automatically recognizing a previously visited location thro...
This dissertation addresses the problem of learning video representations, which is defined here as ...
2018-08-02Object localization is a crucial step for computers to understand an image. An object loca...
Locality is taken as the surrounding or nearby region of something in the world. Without the notion ...
The rise of deep learning has facilitated remarkable progress in video understanding. This thesis ad...
Egocentric vision is a technology that exists in a variety of fields such as life-logging, sports re...
Egocentric vision is a technology that exists in a variety of fields such as life-logging, sports re...
International audienceWe propose an effective approach for action localization, both in the spatial ...
We introduce an approach for spatio-temporal human action localization using sparse spatial supervis...
Human behavior understanding is a fundamental problem of computer vision. It is an important compone...
This paper strives for spatio-temporal localization of human actions in videos. In the literature, t...
For several emerging technologies such as augmented reality, autonomous driving and robotics, visual...
In this thesis the problem of automatic human action recognition and localization in videos is studi...
This thesis explores several practical applications of computer vision using learning based techniqu...
Free to read on publisher's website Convolutional Neural Networks (CNNs) have recently been shown to...
Visual place recognition is the task of automatically recognizing a previously visited location thro...
This dissertation addresses the problem of learning video representations, which is defined here as ...
2018-08-02Object localization is a crucial step for computers to understand an image. An object loca...