From: The effects of class rarity on the evaluation of supervised healthcare fraud detection models
 | Dataset | Features | Non-fraud | Fraud | % Fraud |
---|---|---|---|---|---|
Train | Part B | 126 | 3,691,146 | 1409 | 0.038 |
Part D | 126 | 2,098,715 | 1018 | 0.048 | |
DMEPOS | 145 | 862,792 | 635 | 0.074 | |
Combined | 173 | 759,267 | 473 | 0.062 | |
Test | Part B | 126 | 999,815 | 99 | 0.010 |
Part D | 123 | 744,918 | 135 | 0.018 | |
DMEPOS | 119 | 290,548 | 75 | 0.026 | |
Combined | 171 | 256,529 | 55 | 0.021 |