Egocentric 3D human pose estimation (HPE) from images is challenging due to severe self-occlusions and strong distortion introduced by the fish-eye view from the head mounted camera. Although existing works use intermediate heatmap-based representations to counter distortion with some success, addressing self-occlusion remains an open problem. In this work, we leverage information from past frames to guide our self-attention-based 3D HPE estimation procedure -- Ego-STAN. Specifically, we build a spatio-temporal Transformer model that attends to semantically rich convolutional neural network-based feature maps. We also propose feature map tokens: a new set of learnable parameters to attend to these feature maps. Finally, we demonstrate Ego-S...
In this paper, an automatic approach for 3D pose reconstruction from a single image is proposed. The...
There has been a recent surge of interest in introducing transformers to 3D human pose estimation (H...
This paper introduces a new architecture for human pose estimation using a multi-layer convolutional...
We present a solution to egocentric 3D body pose estimation from monocular images captured from down...
Egocentric vision is an emerging topic, which has demonstrated great potential in assistive healthca...
Egocentric 3D human pose estimation with a single head-mounted fisheye camera has recently attracted...
Estimating 3D human motion from an egocentric video sequence is critical to human behavior understan...
We present a new solution to egocentric 3D body pose estimation from monocular images captured from ...
We propose a method for object-aware 3D egocentric pose estimation that tightly integrates kinematic...
2022 Summer.Includes bibliographical references.Recent head-mounted virtual reality (VR) devices inc...
In this work the inherently ambiguous task of predicting 3D human poses from monocular RGB images is...
The neural network based approach for 3D human pose estimation from monocular images has attracted g...
We present UnrealEgo, i.e., a new large-scale naturalistic dataset for egocentric 3D human pose esti...
Recent advancements in computer vision have seen a rise in the prominence of applications using neur...
Human detection and pose estimation are essential components for any artificial system responsive to...
In this paper, an automatic approach for 3D pose reconstruction from a single image is proposed. The...
There has been a recent surge of interest in introducing transformers to 3D human pose estimation (H...
This paper introduces a new architecture for human pose estimation using a multi-layer convolutional...
We present a solution to egocentric 3D body pose estimation from monocular images captured from down...
Egocentric vision is an emerging topic, which has demonstrated great potential in assistive healthca...
Egocentric 3D human pose estimation with a single head-mounted fisheye camera has recently attracted...
Estimating 3D human motion from an egocentric video sequence is critical to human behavior understan...
We present a new solution to egocentric 3D body pose estimation from monocular images captured from ...
We propose a method for object-aware 3D egocentric pose estimation that tightly integrates kinematic...
2022 Summer.Includes bibliographical references.Recent head-mounted virtual reality (VR) devices inc...
In this work the inherently ambiguous task of predicting 3D human poses from monocular RGB images is...
The neural network based approach for 3D human pose estimation from monocular images has attracted g...
We present UnrealEgo, i.e., a new large-scale naturalistic dataset for egocentric 3D human pose esti...
Recent advancements in computer vision have seen a rise in the prominence of applications using neur...
Human detection and pose estimation are essential components for any artificial system responsive to...
In this paper, an automatic approach for 3D pose reconstruction from a single image is proposed. The...
There has been a recent surge of interest in introducing transformers to 3D human pose estimation (H...
This paper introduces a new architecture for human pose estimation using a multi-layer convolutional...