Skip to main content

Table 1 Descriptive and inference statistics of storage space usage (NHIS)

From: An empirical comparison of the performances of single structure columnar in-memory and disk-resident data storage techniques using healthcare big data

(A) Attribute

(B) (Items) Size in Disk DB

(C) (Items) Size in In-Memory

(D) Compression Factor (B/C)

(E) (% Space Saving) (1—(C/B))* 100

Identifier

1,056,132

9

117,348

99.9991

Firstname

1,056,132

123,753

8.5342

88.2824

Relationship

1,056,132

6

176,022

99.9994

Sex

1,056,132

2

528,066

99.9998

Surname

1,056,132

89,592

11.7882

91.517

Type

1,056,132

6

176,022

99.9994

Date of birth

1,056,132

22,037

47.9254

97.9134

Dictionary

 Encoding

1,056,132

  

 TOTAL:

7,392,924

1,291,537

  

MEAN = 1,056.132 161,442.125

STD DEV = 0364,639.4471

MEAN SPACE SPACING % = (1—(TOTALC/TOTALB))* 100 = (1(1291537/7392924))* 100 = 82.53%