From: Survey of transformers and towards ensemble learning using transformers for natural language processing
Model
BERT
XLNet
RoBERTa
ALBERT
Acc
0.935
0.954
0.884
0.952
F1
0.986
0.988
0.918
0.990