From: Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems
SF | Data model | Without data organization strategies | Bucketing | |||
---|---|---|---|---|---|---|
Orderkey | Od_Year (sorted by P_Brand) | Suppkey | Orderdate, Custkey, Suppkey, Partkey | |||
30 | SS | 92 s | 133 s | – | 120 s | 121 s |
44% | – | 30% | 32% | |||
DT | 63 s | 71 s | 41 s | – | – | |
14% | − 35% | – | – | |||
100 | SS | 262 s | 305 s | – | 321 s | 305 s |
16% | – | 22% | 16% | |||
DT | 155 s | 178 s | 103 s | – | – | |
15% | − 34% | – | – | |||
300 | SS | 733 s | – | – | 768 s | 876 s |
– | – | 5% | 19% | |||
DT | 472 s | – | – | – | – | |
– | – | – | – |