Skip to main content

Table 1 BERT architectures

From: Enhancing argumentation component classification using contextual language model

BERT model

Encoders

Attention heads

Hidden layers

BERT Base

12

12

768

DistilBERT Base

6

12

768