Skip to main content

Table 3 Neighborhood size (α) estimated from the data and from the clustering result after T Iterations

From: Efficiency of random swap clustering

Dataset

Full data

From clustering

Estimated iterations (T)

Initial T = 0

Early T = 5

Final T = 5000

q = 10%

q = 1%

q = 0.1%

Bridge

69.8

8.7

5.4

4.6

33,595

67,910

100,785

House

15.4

6.7

8.3

8.2

13,381

26,761

40,142

Miss America

346

34.2

17.1

11.9

3593

7078

10,617

Europe

(5.0)

4.8

6.3

6.3

26,699

53,398

80,098

BIRCH 1

5.0

4.5

5.8

5.6

2908

5815

8723

BIRCH 2

(4.7)

3.1

3.1

2.9

10,524

21,048

31,572

BIRCH 3

(4.9)

4.1

4.9

5.0

4508

9016

13,523

S 1

4.8

3.7

4.1

4.2

46

92

137

S 2

4.9

3.7

4.5

4.7

37

73

110

S 3

4.9

3.9

4.4

4.3

38

77

115

S 4

4.9

3.9

4.8

5.0

32

64

97

Unbalance

3.4

2.3

2.3

2.0

56

111

167

Dim-32

26.8

1.5

1.1

1.0

920

1839

2759

Dim-64

37.1

1.9

1.1

1.0

920

1839

2759

Dim-128

47.3

1.4

1.0

1.0

1135

2271

3406

KDD04-Bio

286.2

33.3

30.4

72,800

145,600

218,401

  1. Estimated number of iterations (T) for selected values of q are calculated as T = − ln q ln w (k/α)2