Skip to main content

Table 1 Description of real-life data sets

From: Clustering categorical data based on the relational analysis approach and MapReduce

Data set

Size

Number of attributes

Number of classes

Missing values

Soybean

47

35

4

No

Zoo

101

17

7

No

Mushroom

8124

22

2

Yes