We propose a method for object-aware 3D egocentric pose estimation that tightly integrates kinematics modeling, dynamics modeling, and scene object information. Unlike prior kinematics or dynamics-based approaches where the two components are used disjointly, we synergize the two approaches via dynamics-regulated training. At each timestep, a kinematic model is used to provide a target pose using video evidence and simulation state. Then, a prelearned dynamics model attempts to mimic the kinematic pose in a physics simulator. By comparing the pose instructed by the kinematic model against the pose generated by the dynamics model, we can use their misalignment to further improve the kinematic model. By factoring in the 6DoF pose of objects (...
Estimating human motion from video is an active research area due to its many potential applications...
This thesis deals with object pose estimation and tracking, and solve robot manipulation tasks. It a...
We introduce a data-driven method to generate a large number of plausible, closely interacting 3D hu...
Estimating 3D human motion from an egocentric video sequence is critical to human behavior understan...
We propose embodied scene-aware human pose estimation where we estimate 3D poses based on a simulate...
We present UnrealEgo, i.e., a new large-scale naturalistic dataset for egocentric 3D human pose esti...
In this thesis, we argue that the 3D scene is vital for understanding, reconstructing, and synthesiz...
Egocentric 3D human pose estimation (HPE) from images is challenging due to severe self-occlusions a...
Egocentric 3D human pose estimation with a single head-mounted fisheye camera has recently attracted...
We introduce a novel task of reconstructing a time series of second-person 3D human body meshes from...
We present a new method for generating controllable, dynamically responsive, and photorealistic huma...
3D human pose estimation from a monocular video has recently seen significant improvements. However,...
We propose a technique for learning single-view 3D object pose estimation models by utilizing a new ...
Human motion capture either requires multi-camera systems or is unreliable using single-view input d...
To produce safe human motions, assistive wearable exoskeletons must be equipped with a perception sy...
Estimating human motion from video is an active research area due to its many potential applications...
This thesis deals with object pose estimation and tracking, and solve robot manipulation tasks. It a...
We introduce a data-driven method to generate a large number of plausible, closely interacting 3D hu...
Estimating 3D human motion from an egocentric video sequence is critical to human behavior understan...
We propose embodied scene-aware human pose estimation where we estimate 3D poses based on a simulate...
We present UnrealEgo, i.e., a new large-scale naturalistic dataset for egocentric 3D human pose esti...
In this thesis, we argue that the 3D scene is vital for understanding, reconstructing, and synthesiz...
Egocentric 3D human pose estimation (HPE) from images is challenging due to severe self-occlusions a...
Egocentric 3D human pose estimation with a single head-mounted fisheye camera has recently attracted...
We introduce a novel task of reconstructing a time series of second-person 3D human body meshes from...
We present a new method for generating controllable, dynamically responsive, and photorealistic huma...
3D human pose estimation from a monocular video has recently seen significant improvements. However,...
We propose a technique for learning single-view 3D object pose estimation models by utilizing a new ...
Human motion capture either requires multi-camera systems or is unreliable using single-view input d...
To produce safe human motions, assistive wearable exoskeletons must be equipped with a perception sy...
Estimating human motion from video is an active research area due to its many potential applications...
This thesis deals with object pose estimation and tracking, and solve robot manipulation tasks. It a...
We introduce a data-driven method to generate a large number of plausible, closely interacting 3D hu...