From: Real-time event detection in social media streams through semantic analysis of noisy terms
S/N | Dataset | Source(s) | Total | Selected | Training/Testing |
---|---|---|---|---|---|
1 | Twitter sentiment analysis training corpus | 1. University of Michigan Sentiment Analysis on Kaggle 2. Twitter sentiment corpus by Niek Sanders | 1,578,627 1,048,575 (after download) | 104,857 (10%) | 83,886/20,971 |
2 | Naija-Tweets | Extracted from Nigeria origin | 12,920 | 12,920 (100%) | 10,336/2,584 |