Table 1 Description of real-life data sets

From: Clustering categorical data based on the relational analysis approach and MapReduce

Data set Size Number of attributes Number of classes Missing values
Soybean 47 35 4 No
Zoo 101 17 7 No
Mushroom 8124 22 2 Yes