From: Survey of transformers and towards ensemble learning using transformers for natural language processing
Model
BERT
XLNet
GPT2
RoBERTa
ALBERT
Text Generation
0.31578
0.11133
0.36234
0.41700
0.26518