Skip to main content

Table 5 The basic information of the 4 data sets

From: Optimal instance subset selection from big data using genetic algorithm and open source framework

Data sets

Number of instances

Number of attributes

Number of classes

Shuttle

58,000

9

7

Poker

1,000,000

10

10

CovType

581,012

54

7

Skin

245,057

3

2