Skip to main content

Table 6 SSB execution times (in seconds): bucketing by “Suppkey”

From: Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems

 

SF = 30

SF = 100

SF = 300

SF = 30

SF = 100

SF = 300

HIVE

PRESTO

SS

SS-B

SS

SS-B

SS

SS-B

SS

SS-B

SS

SS-B

SS

SS-B

Q1.1

25

22

31

29

44

46

5

6

13

16

36

36

Q1.2

24

23

29

29

42

44

5

7

13

16

34

36

Q1.3

24

23

29

30

43

45

4

7

13

14

35

34

Q2.1

32

31

47

53

531

110

8

11

19

25

59

62

Q2.2

31

29

46

66

531

611

7

9

18

22

51

56

Q2.3

30

29

44

49

531

101

7

9

17

22

49

53

Q3.1

35

33

59

68

651

137

8

11

29

35

81

83

Q3.2

30

28

45

52

677

92

6

8

17

23

51

53

Q3.3

33

33

219

45

665

80

5

7

15

17

43

44

Q3.4

34

30

222

44

675

78

6

6

15

19

43

43

Q4.1

38

39

86

88

226

237

13

17

43

51

119

127

Q4.2

49

49

70

65

141

119

9

11

26

32

69

75

Q4.3

34

35

54

57

116

103

8

10

23

28

63

67

Total

420

404

982

676

4874

1803

92

120

262

321

733

768

Diff.

 

− 4%

 

− 31%

 

− 63%

 

30%

 

22%

 

5%

  1. Italic values indicate the fastest processing time by query, workload, tool and data model