Skip to main content

Table 5 SSB execution times (in seconds): bucketing by “Od_Year” sorted by “P_Brand”

From: Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems

  SF = 30 SF = 100 SF = 30 SF = 100
HIVE PRESTO
DT DT-B DT DT-B DT DT-B DT DT-B
Q1.1 24 18 29 21 5 3 13 8
Q1.2 24 19 29 21 5 3 14 9
Q1.3 23 18 30 22 5 3 14 8
Q2.1 25 18 36 20 4 2 10 4
Q2.2 36 18 73 20 4 2 10 3
Q2.3 25 18 35 16 4 2 10 4
Q3.1 28 26 40 39 5 5 12 14
Q3.2 28 25 41 40 5 5 12 13
Q3.3 25 23 38 32 4 4 9 11
Q3.4 25 26 38 39 5 5 12 13
Q4.1 27 22 41 30 6 3 14 9
Q4.2 29 20 42 23 6 2 14 5
Q4.3 29 14 42 15 5 2 12 4
Total 349 265 516 337 63 41 155 103
Diff.   − 24%   − 35%   − 35%   − 34%
  1. Italic values indicate the fastest processing time by query, workload, tool and data model, also pointing the queries that include the sorted attribute in the “group by” and “order by” clauses