Skip to main content
Fig. 21 | Journal of Big Data

Fig. 21

From: Cumulative deviation of a subpopulation from the full population

Fig. 21

San Joaquin County, reporting the number of related children in the household, with scores being \(\log _{10}\) of the adjusted household income; \(n =\) 2282; Kuiper’s statistic is \(0.2449 / \sigma = 9.120\), Kolmogorov’s and Smirnov’s is \(0.2429 / \sigma = 9.045\). The lack of deviation between the subpopulation and the full population right near scores of 4.0 is difficult to discern in the reliability diagrams with 10 or 20 bins each. The reliability diagrams with around 100 bins each do display the lack of deviation near scores of 4.0, but the rest of these diagrams is really noisy. The cumulative plot nicely captures the lack of deviation near scores of 4.0. Overall, the scalar summary statistics detect highly statistically significant deviation

Back to article page