Table 4 Workload application characteristics

From: Runtime prediction of big data jobs: performance comparison of machine learning algorithms and analytical models

Workloads Stages Parallel stages Collect Serialization Deserialization Shuffle Aggregate
WC 2 No Yes Yes
SVM 209 No Yes No Yes Yes Yes
Nweight 9 Yes No Yes Yes
kmeans 20 No Yes Yes Yes Yes
Pagerank 5 No No Yes Yes