Fig. 7
From: Experimenting sensitivity-based anonymization framework in apache spark

Comparison in process-time between Pig and Scala scripts with 6 workers: This is the second experiment where 6 workers and one master are used. The experiment showed a good performance for Spark, when data size is smaller than 20 GB. Pig gains better performance when data size increases dramatically in comparison with the available memory