Skip to main content

Table 4 Baseline results for four algorithms used in this study

From: Investigating the impact of pre-processing techniques and pre-trained word embeddings in detecting Arabic health information on social media

Algorithm

N-gram

Feature selection

F1 score

LinearSVC

1

Term frequency

84 .0

Logistic regression

1, 2

TF-IDF

84.0

Multinomial NB

1, 2

TF-IDF

86.0

KNN

1, 2

TF-IDF

77.6