From: Clustering categorical data based on the relational analysis approach and MapReduce
Data set
Size
Number of attributes
Number of classes
Missing values
Soybean
47
35
4
No
Zoo
101
17
7
Mushroom
8124
22
2
Yes