Skip to main content

Table 15 Best results by multiple partitioning configuration and by SF

From: Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems

SF Data model Without data organization strategies Multiple partitioning
Od_Year, S_Region S_Region, S_Nation, S_City
30 SS 92 s 63 s 70 s
− 32% − 24%
DT 63 s 43 s
− 32%
100 SS 262 s 149 s 193 s
− 43% − 26%
DT 155 s 71 s
− 54%
300 SS 733 s 399 s
− 46%
DT 472 s 299 s
− 37%
  1. Italic values indicate the fastest processing time by SF, data model and configuration