From: Intelligent video surveillance: a review through deep learning techniques for crowd analysis
Dataset | Type/purpose | Model/schema used |
---|---|---|
ImageNet2012 | Images | Â |
PASCAL VOC | Images | Â |
Frames Labeled In Cinema (FLIC) | Popular holywood movies | Â |
Leeds Sports Pose (LSP) | Sports people gathered from FLICKR | Â |
CAVIAR | Used for event detection of surveillance domain | Threshold Model used for spatio temporal motion analysis and Bag of Actions for reducing search space [1] |
BEHAVE | Used for event detection of surveillance domain | Threshold Model used for spatio temporal motion analysis and Bag of Actions for reducing search space [1] |
YTO | Videos collected from YouTube | Â |
i-LIDS sterile zone | People detection | Intrusion detection system with global features [91] |
PETS 2001 | Images | Intrusion detection system with global features [91] |
MoSIFT | Movie dataset | Â |
STIP | Hockey dataset | Â |
MediaEval 2013 dataset | Collection of movies | Â |
UCSD pedestrian | Pedestrian walkway | Convolutional auto-encoder model [12] |