Skip to main content

Table 5 The best execution time of MapReduce and Spark with WordCount workload

From: A comprehensive performance analysis of Apache Hadoop and Apache Spark for large scale data sets using HiBench

 

Split sizes (MB)

Execution time (s)

MapReduce input splits (WordCount)

128

2376

Spark input splits (WordCount)

256

1392

MapReduce shuffle (WordCount)

100

2371

Spark shuffle (WordCount)

300

1334