Skip to main content

Table 16 Best results by bucketing configuration and by SF

From: Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems

SF

Data model

Without data organization strategies

Bucketing

Orderkey

Od_Year (sorted by P_Brand)

Suppkey

Orderdate, Custkey, Suppkey, Partkey

30

SS

92 s

133 s

120 s

121 s

44%

30%

32%

DT

63 s

71 s

41 s

14%

− 35%

100

SS

262 s

305 s

321 s

305 s

16%

22%

16%

DT

155 s

178 s

103 s

15%

− 34%

300

SS

733 s

768 s

876 s

5%

19%

DT

472 s

  1. Italic values indicate the fastest processing time by SF, data model and configuration