Skip to main content

Table 4 Table comparing Random Forest accuracy for various encoding techniques, reproduced from [62]

From: Survey on categorical data for neural networks

Encoding scheme

Dimensionality

Training time (s)

Average training score

Score StDev

BackwardDifference

81

9.445193

0.961925

0.002291

BinaryEncoder

13

9.234833

0.962050

0.002472

HashingEncoder

8

20.524086

0.918650

0.002197

HelmertEncoder

81

9.418384

0.962100

0.002359

OnehotEncoder

84

8.884236

0.961950

0.002361

OrdinalEncoder

3

8.443738

0.961950

0.002513

SumEncoder

81

9.405340

0.961975

0.002560

PolynomialEncoder

81

9.642599

0.962000

0.002327

BaseNEncoder

13

10.734352

0.961925

0.002342

LeaveOneOutEncoder

3

8.746265

0.962150

0.002444