Skip to main content

Advertisement

Table 6 Clustering 17.79 GB raster image using a 2 node Hadoop cluster

From: Multi-dimensional geospatial data mining in a distributed environment using MapReduce

Block size (MB) Elapsed time (min) Average map time (min) Average shuffle time (min) Average merge time (min) Average reduce time (min) Total mapping time (min)
128 99.472 61.95 22.262 0.274 1.96 88.426
265 85.7 69.7 14.1 0.3 1.5 74.2
512 74.0 58.6 10.3 0.3 1.8 63.9
1024 114.2 95.3 103.3 0.3 1.7 104.1
18 GB 146.3 114.1 136.5 0.0 1.3 137.0