Skip to main content

Table 4 Workload application characteristics

From: Runtime prediction of big data jobs: performance comparison of machine learning algorithms and analytical models

Workloads

Stages

Parallel stages

Collect

Serialization

Deserialization

Shuffle

Aggregate

WC

2

No

Yes

–

–

Yes

–

SVM

209

No

Yes

No

Yes

Yes

Yes

Nweight

9

Yes

–

No

Yes

Yes

–

kmeans

20

No

Yes

Yes

Yes

Yes

–

Pagerank

5

No

–

No

Yes

Yes

–