From: Survey of transformers and towards ensemble learning using transformers for natural language processing
Model
BERT
XLNet
GPT2
RoBERTa
ALBERT
Weight
0.316
0.111
0.362
0.265
0.417