Skip to main content

Table 4 Clustering (1.9 GB) raster image with a 2 node Hadoop cluster

From: Multi-dimensional geospatial data mining in a distributed environment using MapReduce

Block size (MB)

Elapsed time (min)

Average map time (min)

Average shuffle time (min)

Average merge time (min)

Average reduce time (min)

Total mapping time (min)

16

16.3

13.1

2.3

0.1

0.2

12.9

32

13.0

10.0

2.1

0.1

0.2

11.6

64

11.5

9.3

4.3

0.2

0.1

10.5

96

11.8

9.7

6.6

0.2

0.2

10.9

128

14.1

11.8

12.2

0.2

0.1

10.8

256

14.9

11.6

13.5

0.1

2.9

14.2