Back

The Stereo Human Pose Estimation Dataset

SHPED contains annotated poses in stereo videos

The Stereo Human Pose Estimation Dataset

We provide a dataset of stereo image pairs suited for stereo human pose estimation of upper-body people. SHPED consists of 630 stereo image pairs (i.e. 1260 images) classified into 42 video clips of 15 frames each. The clips have been extracted from 26 stereo videos, obtained from YouTube with the tag yt3d:enable = true. In addition, SHPED contains 1470 stickman upper-body annotations corresponding to 49 persons according these conditions: up-right position, all upper-body parts almost visible, and non-profile viewpoint of the body. Furthermore, we include a plane projective transformation in every clip for rectifying and detections (bounding boxes) of each person along the sequence. The stereo image pairs are in a wide range of variations in appearance, clothing, human pose, illumination, image quality, baseline separation of the cameras, and/or background.

Try V7 now
->
ResearchGate
View author website
Task
Human Pose Estimation
Annotation Types
Keypoint Skeleton
15
Items
42
Classes
1260
Labels
Models using this dataset
Last updated on 
October 31, 2023
Licensed under 
Research Only
Blog
Learn about machine learning and latests advancements in AI.
Read More
Playbooks
Discover how to optimize AI for your business.
Learn more
Case Studies
Discover how V7 empowers AI industry greats.
Explore now
Webinars
Explore AI topics, gain insights, and learn from experts.
Watch now