Skip to main content
Fig. 1 | Journal of Big Data

Fig. 1

From: HybSMRP: a hybrid scheduling algorithm in Hadoop MapReduce framework

Fig. 1

Processing phases in MapReduce [12]. Mappers run on unsorted input key/value pairs. Each mapper emits zero, one, or multiple output key/value pairs for each input key/value pairs. The shuffle and sort phase is done by the framework. Data from all mappers are grouped by the key, split among reducers and sorted by the key. Each reducer obtains all values associated with the same key

Back to article page