Skip to main content

Table 15 Best results by multiple partitioning configuration and by SF

From: Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems

SF

Data model

Without data organization strategies

Multiple partitioning

Od_Year, S_Region

S_Region, S_Nation, S_City

30

SS

92 s

63 s

70 s

− 32%

− 24%

DT

63 s

43 s

− 32%

100

SS

262 s

149 s

193 s

− 43%

− 26%

DT

155 s

71 s

− 54%

300

SS

733 s

399 s

− 46%

DT

472 s

299 s

− 37%

  1. Italic values indicate the fastest processing time by SF, data model and configuration