From: Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems
SF | Partitioning | Bucketing | Partitioning and bucketing | Configuration (best scenario) | |||
---|---|---|---|---|---|---|---|
SS | DT | SS | DT | SS | DT | ||
30 | 41 s | Bucketing by “Od_Year” (Sorted by “P_Brand”) | |||||
100 | 71 s | Multiple Partitioning by “Od_Year” and “S_Region” | |||||
300 | 299 s | Multiple Partitioning by “Od_Year” and “S_Region” |