Skip to main content

Table 3 32 shards: correlation between semantic and syntactic distance metrics

From: Selecting a representative decision tree from an ensemble of decision-tree models for fast big data classification

ID

UCI dataset name

\(R^2\) (J48)

\(R^2\) (CART)

DS1

Poker Hand

0.0243

0.2039

DS2

SUSY

0.0594

0.0015

DS3

RLCP

0.2318

0.7709

DS4

KDD Cup

0.1555

0.5613

DS5

Household Electric

0.1023

0.4741

DS6

HIGGS

0.0938

0.3617