From: Survey of transformers and towards ensemble learning using transformers for natural language processing
Model
BERT
GPT2
XLNet
RoBERTa
ALBERT
Acc
0.76434
0.64691
0.68825
0.71695
0.76645
F1
0.77443
0.65966
0.70266
0.72899
0.77559
P
0.77851
0.67552
0.70387
0.73279
0.78785
R
0.77297
0.65249
0.70604
0.72802
0.76654