Skip to main content

Table 9 SSB execution times (in seconds): partitioning by “Od_Year” and bucketing by “Orderkey” (star schema with partitions and buckets (SS-PB)).

From: Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems

 

SF = 30

SF = 100

SF = 300

SF = 30

SF = 100

SF = 300

HIVE

PRESTO

SS

SS-PB

SS

SS-PB

SS

SS-PB

SS

SS-PB

SS

SS-PB

SS

SS-PB

Q1.1

25

16

31

21

44

26

5

2

13

4

36

7

Q1.2

24

23

29

32

42

42

5

6

13

13

34

44

Q1.3

24

18

29

21

43

25

4

2

13

4

35

8

Q2.1

32

33

47

60

531

682

8

11

19

26

59

98

Q2.2

31

32

46

53

531

677

7

10

18

23

51

76

Q2.3

30

30

44

52

531

670

7

9

17

22

49

74

Q3.1

35

31

59

56

651

667

8

10

29

29

81

100

Q3.2

30

28

45

50

677

634

6

7

17

19

51

63

Q3.3

33

33

219

78

665

648

5

6

15

16

43

53

Q3.4

34

31

222

228

675

674

6

7

15

19

43

59

Q4.1

38

39

86

102

226

253

13

17

43

50

119

164

Q4.2

49

35

70

63

141

91

9

7

26

18

69

48

Q4.3

34

28

54

50

116

77

8

6

23

14

63

38

Total

420

378

982

865

4874

5166

92

100

262

256

733

835

Diff

 

− 10%

 

− 12%

 

6%

 

8%

 

− 2%

 

14%

  1. Italic values indicate the fastest processing time by query, workload, tool and data model