Skip to main content

Advertisement

Table 1 Datasets used in our experiments

From: Meta-MapReduce for scalable data mining

Data No. of instances No. of attributes Size on disk
yeast 892 8 34 KB
wineRed 1599 12 84 KB
wineWhite 4898 12 263 KB
pendigits 7494 16 360 KB
spambase 4601 57 687 KB
musk 6598 167 4.2 MB
telescope 19020 11 1.4 MB
kdd 148517 42 21.2 MB
isolet 7797 618 30.7 MB
org 2059 9731 38.4 MB
census 299285 42 129 MB
S1 100000 400 210 MB
S2 200000 400 420 MB