When analyzing human motion videos, the output jitters from existing pose estimators are highly-unbalanced with varied estimation errors across frames. Most frames in a video are relatively easy to estimate and only suffer from slight jitters. In contrast, for rarely seen or occluded actions, the estimated positions of multiple joints largely deviate from the ground truth values for a consecutive sequence of frames, rendering significant jitters on them. To tackle this problem, we propose to attach a dedicated temporal-only refinement network to existing pose estimators for jitter mitigation, named SmoothNet. Unlike existing learning-based solutions that employ spatio-temporal models to co-optimize per-frame precision and temporal smoothnes...
The objective of this work is human pose estimation in videos, where multiple frames are available. ...
The power of ConvNets has been demonstrated in a wide variety of vision tasks including pose estimat...
We introduce MotioNet, a deep neural network that directly reconstructs the motion of a 3D human ske...
Estimating 3D poses from a monocular video is still a challenging task, despite the significant prog...
Estimating human poses from videos is critical in human-computer interaction. By precisely estimatin...
This paper proposes a simple baseline framework for video-based 2D/3D human pose estimation that can...
International audienceMost state-of-the-art methods for action recognition rely on a two-stream arch...
Though continuous advances in the field of human pose estimation, it remains a challenge to retrieve...
Predicting 3D human pose from a single monoscopic video can be highly challenging due to factors suc...
We address the problem of articulated human pose es-timation in videos using an ensemble of tractabl...
We present a bundle-adjustment-based algorithm for recovering accurate 3D human pose and meshes from...
Recording real life human motion as a skinned mesh animation with an acceptable quality is usually d...
We present a novel method for learning human motion models from unsegmented videos. We propose a uni...
Human motion prediction, i.e., forecasting future body poses given observed pose sequence, has typic...
The objective of this work is human pose estimation in videos, where multiple frames are available. ...
The objective of this work is human pose estimation in videos, where multiple frames are available. ...
The power of ConvNets has been demonstrated in a wide variety of vision tasks including pose estimat...
We introduce MotioNet, a deep neural network that directly reconstructs the motion of a 3D human ske...
Estimating 3D poses from a monocular video is still a challenging task, despite the significant prog...
Estimating human poses from videos is critical in human-computer interaction. By precisely estimatin...
This paper proposes a simple baseline framework for video-based 2D/3D human pose estimation that can...
International audienceMost state-of-the-art methods for action recognition rely on a two-stream arch...
Though continuous advances in the field of human pose estimation, it remains a challenge to retrieve...
Predicting 3D human pose from a single monoscopic video can be highly challenging due to factors suc...
We address the problem of articulated human pose es-timation in videos using an ensemble of tractabl...
We present a bundle-adjustment-based algorithm for recovering accurate 3D human pose and meshes from...
Recording real life human motion as a skinned mesh animation with an acceptable quality is usually d...
We present a novel method for learning human motion models from unsegmented videos. We propose a uni...
Human motion prediction, i.e., forecasting future body poses given observed pose sequence, has typic...
The objective of this work is human pose estimation in videos, where multiple frames are available. ...
The objective of this work is human pose estimation in videos, where multiple frames are available. ...
The power of ConvNets has been demonstrated in a wide variety of vision tasks including pose estimat...
We introduce MotioNet, a deep neural network that directly reconstructs the motion of a 3D human ske...