From: Clustering categorical data based on the relational analysis approach and MapReduce
 Cluster | Size | Distribution | Purity | ||||||
---|---|---|---|---|---|---|---|---|---|
\(C_1\) | \(C_2\) | \(C_3\) | \(C_4\) | \(C_5\) | \(C_6\) | \(C_7\) | |||
1 | 42 | 41 | 0 | 1 | 0 | 0 | 0 | 0 | 0.98 |
2 | 5 | 0 | 4 | 1 | 0 | 0 | 0 | 0 | 0.80 |
3 | 17 | 0 | 16 | 0 | 0 | 0 | 1 | 0 | 0.94 |
4 | 17 | 0 | 0 | 3 | 13 | 1 | 0 | 0 | 0.76 |
5 | 3 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 1 |
6 | 5 | 0 | 0 | 0 | 0 | 0 | 5 | 0 | 1 |
7 | 12 | 0 | 0 | 0 | 0 | 0 | 2 | 10 | 0.83 |