We present ConCur, a contrastive video representation learning method that uses curriculum learning to impose a dynamic sampling strategy in contrastive training. More specifically, ConCur starts the contrastive training with easy positive samples (temporally close and semantically similar clips), and as the training progresses, it increases the temporal span effectively sampling hard positives (temporally away and semantically dissimilar). To learn better context-aware representations, we also propose an auxiliary task of predicting the temporal distance between a positive pair of clips. We conduct extensive experiments on two popular action recognition datasets, UCF101 and HMDB51, on which our proposed method achieves state-of-the-art per...
The objective of this paper is self-supervised learning of spatio-temporal embeddings from video, su...
Learning time-series representations when only unlabeled data or few labeled samples are available c...
A steady momentum of innovations and breakthroughs has convincingly pushed the limits of unsupervise...
International audienceIn this paper, we propose a self-supervised method for video representation le...
Video representation learning has been successful in video-text pre-training for zero-shot transfer,...
Self-supervised video representation learning aimed at maximizing similarity between different tempo...
Contrastive learning has shown promising potential in self-supervised spatio-temporal representation...
We propose a self-supervised learning approach for videos that learns representations of both the RG...
This thesis presents a novel self-supervised approach of learning visual representations from videos...
Shenzhen Science and Technology Projects (Grant Number: JCYJ20200109143035495 and JCYJ20180306173210...
In low-level video analyses, effective representations are important to derive the correspondences b...
The objective of this paper is visual-only self-supervised video representation learning. We make th...
We propose a novel self-supervised method, referred to as Video Cloze Procedure (VCP), to learn rich...
With the rapid advancement of deep learning techniques in computer vision, researchers have achieved...
The tremendous growth in video data, both on the internet and in real life, has encouraged the devel...
The objective of this paper is self-supervised learning of spatio-temporal embeddings from video, su...
Learning time-series representations when only unlabeled data or few labeled samples are available c...
A steady momentum of innovations and breakthroughs has convincingly pushed the limits of unsupervise...
International audienceIn this paper, we propose a self-supervised method for video representation le...
Video representation learning has been successful in video-text pre-training for zero-shot transfer,...
Self-supervised video representation learning aimed at maximizing similarity between different tempo...
Contrastive learning has shown promising potential in self-supervised spatio-temporal representation...
We propose a self-supervised learning approach for videos that learns representations of both the RG...
This thesis presents a novel self-supervised approach of learning visual representations from videos...
Shenzhen Science and Technology Projects (Grant Number: JCYJ20200109143035495 and JCYJ20180306173210...
In low-level video analyses, effective representations are important to derive the correspondences b...
The objective of this paper is visual-only self-supervised video representation learning. We make th...
We propose a novel self-supervised method, referred to as Video Cloze Procedure (VCP), to learn rich...
With the rapid advancement of deep learning techniques in computer vision, researchers have achieved...
The tremendous growth in video data, both on the internet and in real life, has encouraged the devel...
The objective of this paper is self-supervised learning of spatio-temporal embeddings from video, su...
Learning time-series representations when only unlabeled data or few labeled samples are available c...
A steady momentum of innovations and breakthroughs has convincingly pushed the limits of unsupervise...