Skip to main content

Advertisement

Table 4 Clustering (1.9 GB) raster image with a 2 node Hadoop cluster

From: Multi-dimensional geospatial data mining in a distributed environment using MapReduce

Block size (MB) Elapsed time (min) Average map time (min) Average shuffle time (min) Average merge time (min) Average reduce time (min) Total mapping time (min)
16 16.3 13.1 2.3 0.1 0.2 12.9
32 13.0 10.0 2.1 0.1 0.2 11.6
64 11.5 9.3 4.3 0.2 0.1 10.5
96 11.8 9.7 6.6 0.2 0.2 10.9
128 14.1 11.8 12.2 0.2 0.1 10.8
256 14.9 11.6 13.5 0.1 2.9 14.2