Skip to main content

Advertisement

Table 3 Neighborhood size (α) estimated from the data and from the clustering result after T Iterations

From: Efficiency of random swap clustering

Dataset Full data From clustering Estimated iterations (T)
Initial T = 0 Early T = 5 Final T = 5000 q = 10% q = 1% q = 0.1%
Bridge 69.8 8.7 5.4 4.6 33,595 67,910 100,785
House 15.4 6.7 8.3 8.2 13,381 26,761 40,142
Miss America 346 34.2 17.1 11.9 3593 7078 10,617
Europe (5.0) 4.8 6.3 6.3 26,699 53,398 80,098
BIRCH 1 5.0 4.5 5.8 5.6 2908 5815 8723
BIRCH 2 (4.7) 3.1 3.1 2.9 10,524 21,048 31,572
BIRCH 3 (4.9) 4.1 4.9 5.0 4508 9016 13,523
S 1 4.8 3.7 4.1 4.2 46 92 137
S 2 4.9 3.7 4.5 4.7 37 73 110
S 3 4.9 3.9 4.4 4.3 38 77 115
S 4 4.9 3.9 4.8 5.0 32 64 97
Unbalance 3.4 2.3 2.3 2.0 56 111 167
Dim-32 26.8 1.5 1.1 1.0 920 1839 2759
Dim-64 37.1 1.9 1.1 1.0 920 1839 2759
Dim-128 47.3 1.4 1.0 1.0 1135 2271 3406
KDD04-Bio 286.2 33.3 30.4 72,800 145,600 218,401
  1. Estimated number of iterations (T) for selected values of q are calculated as T = − ln q ln w (k/α)2