Fig. 9From: Survey of transformers and towards ensemble learning using transformers for natural language processingROC curve for the Albert modelBack to article page