Skip to main content

Table 16 Best results by bucketing configuration and by SF

From: Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems

SF Data model Without data organization strategies Bucketing
Orderkey Od_Year (sorted by P_Brand) Suppkey Orderdate, Custkey, Suppkey, Partkey
30 SS 92 s 133 s 120 s 121 s
44% 30% 32%
DT 63 s 71 s 41 s
14% − 35%
100 SS 262 s 305 s 321 s 305 s
16% 22% 16%
DT 155 s 178 s 103 s
15% − 34%
300 SS 733 s 768 s 876 s
5% 19%
DT 472 s
  1. Italic values indicate the fastest processing time by SF, data model and configuration