From: Addressing big data variety using an automated approach for data characterization
Number of files | NCDC | CDC | ODS | |||
---|---|---|---|---|---|---|
# of files | % of Data set | # of files | % of Data set | # of files | % of Data set | |
0–100 | 2,742 | 95 | 84 | 15 | 3,737 | 63 |
101–500 | 129 | 4 | 102 | 18 | 330 | |
501–10,000 | 18 | 1 | 246 | 44 | 1,592 | 27 |
10,001–100,000 | 68 | 12 | 185 | 3 | ||
100,001–10,000,000 | 59 | 11 | 74 | 1 |