Skip to main content

Advertisement

Table 7 Proportions of errors, outliers, missing and valid values in V38

From: Exploring and cleaning big data with random sample data blocks

Category True value Summary of RSP-based proportions
Mean ± StdDev 5th percentile 50th percentile 95th percentile
Errors 0.00000154 0.00001408 ± 0.00001408 0 0 0
Outliers 0.03630323 0.03663272 ± 0.002324408 0.03328592 0.03656265 0.04053885
Missing values 0.53230704 0.5320224 ± 0.005352080 0.52315101 0.53212264 0.54086298
Valid values 0.43138819 0.4313435 ± 0.005839722 0.42173133 0.43105753 0.44055828
  1. RSP-based proportions were calculated from a sample of RSP blocks (\(g=100\))