A collection of Human interactions with accompanying skeleton metadata.
This data may only be used for research purposes. If you use this data in your work, please contact the authors.
There are 4 files that make up the ShakeFive2 dataset:
- ShakeFive2.background.tar.bz2 - 2 background images for background subtraction (2MB)
- ShakeFive2.metadata.tar.bz2 - 153 xml metadata files (1 per video) (59.5MB)
- ShakeFive2.videos.tar - 153 mp4 videos encoded with x264 (104.7MB)
- ShakeFive2.code.tar.gz - C/C++ code for skeleton rendering (14.4kB)
The 153 videos are encoded with ffmpeg x264 at a resolution of 1280x720.
The metadata is frame based xml data containing the skeleton joints of the actors involved in the scene. The data was collected using a Kinect2 sensor. A Skeleton rendering program (C/C++) is available with this dataset.
Spatio-Temporal Detection of Fine-Grained Dyadic Human Interactions (pdf)
Coert van Gemeren, Ronald Poppe and Remco C. Veltkamp
7th International Workshop on Human Behavior Understanding 2016. pg.116--133.