Fig. 1From: HybSMRP: a hybrid scheduling algorithm in Hadoop MapReduce frameworkProcessing phases in MapReduce [12]. Mappers run on unsorted input key/value pairs. Each mapper emits zero, one, or multiple output key/value pairs for each input key/value pairs. The shuffle and sort phase is done by the framework. Data from all mappers are grouped by the key, split among reducers and sorted by the key. Each reducer obtains all values associated with the same keyBack to article page