From: Annotating and detecting topics in social media forum and modelling the annotation to derive directions-a case study
Before normalizing and cleaning
After normalizing and cleaning
Total no. of tokens in the forum
114,345
41279
Total no. of unique tokens
24933
18184
Lexical richness
24933/114345 = 0.218
18184/41279 = 0.440