Skip to main content

Table 7 Dataset lines no classification

From: Addressing big data variety using an automated approach for data characterization

Number of files

NCDC

CDC

ODS

 

# of files

% of Data set

# of files

% of Data set

# of files

% of Data set

0–100

2,742

95

84

15

3,737

63

101–500

129

4

102

18

330

 

501–10,000

18

1

246

44

1,592

27

10,001–100,000

  

68

12

185

3

100,001–10,000,000

  

59

11

74

1