From: Runtime prediction of big data jobs: performance comparison of machine learning algorithms and analytical models
Workloads
Stages
Parallel stages
Collect
Serialization
Deserialization
Shuffle
Aggregate
WC
2
No
Yes
–
SVM
209
Nweight
9
kmeans
20
Pagerank
5