Skip to main content

Table 2 Descriptive and inference statistics of storage space usage (Patient History)

From: An empirical comparison of the performances of single structure columnar in-memory and disk-resident data storage techniques using healthcare big data

(A) Attribute

(B) (Items) Size in Disk DB

(C) (Items) Size in In-Memory

(D) Compression Factor (B/C)

(E) (% Space Saving) (1—(C/B)) * 100

Pat-No

1,003,164

6,182

162.3

99.3837

Surname

1,003,164

6121

163.9

99.3898

Other-name

1,003,164

112,279

8.9

88.8075

Sex

1,003,164

2

501,582.0

99.9998

Age

1,003,164

112

8,956.8

99.9888

Doa

1,003,164

3701

271.1

99.6311

Source

1,003,164

2

501,582.0

99.9998

Provider

1,003,164

239

4,197.3

99.9762

Dod

1,003,164

3290

304.9

99.6720

Sod

1,003,164

2

501,582.0

99.9998

Condition

1,003,164

2

501,582.0

99.9998

Diagnosis

1,003,164

8

125,395.5

99.9992

Dictionary

Encoding

1,003,164

  

TOTAL:

12,037,968.00

1,135,104.00

  

MEAN = 1,003,164.00 87,315.69

STD DEV = 0 319,224.11

MEAN SPACE SPACING % = (1—( TOTALC/TOTALB))* 100= (1(1,135,104/12,037,968))* 100 = 90.57%