From: The effects of class rarity on the evaluation of supervised healthcare fraud detection models
Term | Df | Sum Sq | Mean Sq | F value | Pr(>F) |
---|---|---|---|---|---|
(a) Four factor: Train_Test | |||||
Datasets | 3 | 1.96516 | 0.65505 | 1600.558663 | 0 |
Learner | 2 | 0.66409 | 0.33205 | 811.3215437 | 1.29E−270 |
Pos count | 4 | 2.33669 | 0.58417 | 1427.367028 | 0 |
Ratio | 5 | 0.30092 | 0.06018 | 147.0525251 | 1.17E−136 |
Dataset:learner | 6 | 0.22145 | 0.03691 | 90.18221558 | 2.05E−102 |
Dataset:pos count | 7 | 0.05788 | 0.00827 | 20.2044874 | 1.61E−26 |
Learner:pos count | 8 | 0.03291 | 0.00411 | 10.0523189 | 7.04E−14 |
Dataset:ratio | 15 | 0.08724 | 0.00582 | 14.21108373 | 2.17E−35 |
Learner:ratio | 10 | 0.46537 | 0.04654 | 113.7087576 | 2.69E−194 |
Pos count:ratio | 20 | 0.05443 | 0.00272 | 6.649302133 | 3.98E−18 |
Dataset:learner:pos count | 14 | 0.02401 | 0.00171 | 4.189763574 | 2.52E−07 |
Dataset:learner:ratio | 30 | 0.11577 | 0.00386 | 9.428942232 | 2.99E−40 |
Dataset:pos count:ratio | 35 | 0.01306 | 0.00037 | 0.911606914 | 0.61790442 |
Learner:pos count:ratio | 40 | 0.10682 | 0.00267 | 6.52539348 | 3.70E−32 |
Dataset:learner:pos count:ratio | 70 | 0.04795 | 0.00068 | 1.673573178 | 4.56E−04 |
Residuals | 2430 | 0.99452 | 0.00041 | – | – |
(b) Four factor: Train_CV | |||||
Datasets | 7 | 26.98719 | 3.85531 | 2641.872038 | 0 |
Learner | 2 | 1.34356 | 0.67178 | 460.3389767 | 5.33E−194 |
Pos count | 3 | 6.30292 | 2.10097 | 1439.702651 | 0 |
Ratio | 5 | 1.25714 | 0.25143 | 172.2919289 | 3.65E−178 |
Dataset:learner | 14 | 0.79698 | 0.05693 | 39.00963177 | 2.86E−105 |
Dataset:pos count | 4 | 0.34344 | 0.08586 | 58.83679308 | 2.58E−49 |
Learner:pos count | 6 | 0.07181 | 0.01197 | 8.201310311 | 7.04E−09 |
Dataset:ratio | 35 | 0.32743 | 0.00936 | 6.410645904 | 3.43E−29 |
Learner:ratio | 10 | 1.36440 | 0.13644 | 93.49620948 | 1.03E−187 |
Pos count:ratio | 15 | 0.13786 | 0.00919 | 6.298014086 | 1.64E−13 |
Dataset:learner:pos count | 8 | 0.04741 | 0.00593 | 4.061371638 | 7.71E−05 |
Dataset:learner:ratio | 70 | 0.42329 | 0.00605 | 4.143770726 | 3.52E−28 |
Dataset:pos count:ratio | 20 | 0.01735 | 0.00087 | 0.594299086 | 0.919852682 |
Learner:pos count:ratio | 30 | 0.08452 | 0.00282 | 1.9304991 | 0.001663456 |
Dataset:learner:pos count:ratio | 40 | 0.07660 | 0.00192 | 1.312296909 | 0.089594623 |
Residuals | 13230 | 19.30669 | 0.00146 | – | – |
(c) Two factor: Train_Test and Train_CV | |||||
Evaluation method | 1 | 0.45439 | 0.45439 | 103.9411081 | 2.59E−24 |
Residuals | 12598 | 55.07319 | 0.00437 | – | – |