Skip to main content

Table 18 Best configuration and processing time by SF

From: Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems

SF Partitioning Bucketing Partitioning and bucketing Configuration
(best scenario)
SS DT SS DT SS DT
30     41 s    Bucketing by “Od_Year” (Sorted by “P_Brand”)
100   71 s      Multiple Partitioning by “Od_Year” and “S_Region”
300   299 s      Multiple Partitioning by “Od_Year” and “S_Region”