Skip to main content

Table 9 Summary of twitter sentiment analysis training corpus and Naija-tweet dataset

From: Real-time event detection in social media streams through semantic analysis of noisy terms

S/N

Dataset

Source(s)

Total

Selected

Training/Testing

1

Twitter sentiment analysis training corpus

1. University of

Michigan Sentiment Analysis on Kaggle

2. Twitter sentiment corpus by Niek

Sanders

1,578,627

1,048,575

(after

download)

104,857

(10%)

83,886/20,971

2

Naija-Tweets

Extracted from

Nigeria origin

12,920

12,920

(100%)

10,336/2,584