Skip to main content

Table 2 SSB execution times (in seconds): partitioning by “S_Region”, “S_Nation” and “S_City”

From: Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems

 

SF = 30

SF = 100

SF = 30

SF = 100

HIVE

PRESTO

SS

SS-P

SS

SS-P

SS

SS-P

SS

SS-P

Q1.1

25

32

31

41

5

11

13

36

Q1.2

24

31

29

44

5

11

13

37

Q1.3

24

30

29

43

4

11

13

35

Q2.1

32

30

47

46

8

4

19

10

Q2.2

31

29

46

43

7

4

18

10

Q2.3

30

30

44

45

7

4

17

9

Q3.1

35

28

59

38

8

4

29

11

Q3.2

30

24

45

28

6

2

17

4

Q3.3

33

27

219

35

5

2

15

3

Q3.4

34

26

222

33

6

2

15

3

Q4.1

38

37

86

79

13

6

43

16

Q4.2

49

42

70

63

9

5

26

14

Q4.3

34

30

54

45

8

3

23

5

Total

420

396

982

582

92

70

262

193

Diff.

 

− 6%

 

− 41%

 

− 24%

 

− 26%