From: Severely imbalanced Big Data challenges: investigating data sampling approaches
Learner | Sampling | AUC | std | r | g | Min | Max | Q25 | Q50 | Q75 |
---|---|---|---|---|---|---|---|---|---|---|
(a) AUC | ||||||||||
 GBT | RUS | 0.94364 | 0.08565 | 50 | a | 0.56736 | 0.97822 | 0.96356 | 0.96704 | 0.97073 |
None | 0.68678 | 0.11066 | 10 | b | 0.48887 | 0.85775 | 0.64388 | 0.67656 | 0.74637 | |
SMOTEb2 | 0.66636 | 0.15255 | 50 | b | 0.35152 | 0.97593 | 0.58214 | 0.67336 | 0.75517 | |
SMOTEb1 | 0.65517 | 0.15317 | 50 | b | 0.35417 | 0.96539 | 0.52471 | 0.67769 | 0.75147 | |
SMOTE | 0.63836 | 0.17643 | 50 | b | 0.43299 | 0.98249 | 0.45329 | 0.65340 | 0.68739 | |
ADASYN | 0.63792 | 0.27363 | 50 | b | 0.18138 | 0.98483 | 0.45072 | 0.47832 | 0.96347 | |
ROS | 0.63578 | 0.16737 | 50 | b | 0.43522 | 0.98169 | 0.45196 | 0.65518 | 0.68476 | |
 LR | RUS | 0.85968 | 0.15064 | 50 | a | 0.46331 | 0.98434 | 0.74674 | 0.92661 | 0.96904 |
SMOTE | 0.62164 | 0.04412 | 50 | b | 0.47496 | 0.67054 | 0.59907 | 0.63122 | 0.65456 | |
ROS | 0.61643 | 0.04297 | 50 | b | 0.49468 | 0.67039 | 0.59864 | 0.60161 | 0.65380 | |
SMOTEb1 | 0.59235 | 0.00155 | 50 | b | 0.58955 | 0.59388 | 0.59043 | 0.59325 | 0.59336 | |
None | 0.59203 | 0.00181 | 10 | bc | 0.58977 | 0.59365 | 0.59001 | 0.59324 | 0.59347 | |
SMOTEb2 | 0.59201 | 0.00166 | 50 | bc | 0.58950 | 0.59382 | 0.58991 | 0.59314 | 0.59329 | |
ADASYN | 0.52089 | 0.13781 | 50 | c | 0.42229 | 0.89815 | 0.43665 | 0.45708 | 0.49882 | |
 RF | SMOTE | 0.91978 | 0.02812 | 50 | a | 0.85957 | 0.96023 | 0.90119 | 0.91684 | 0.94535 |
ROS | 0.91933 | 0.02724 | 50 | a | 0.86641 | 0.96169 | 0.90193 | 0.91198 | 0.94159 | |
RUS | 0.91893 | 0.03494 | 50 | a | 0.85684 | 0.96880 | 0.89140 | 0.91016 | 0.95740 | |
SMOTEb2 | 0.87345 | 0.01368 | 50 | b | 0.85486 | 0.90771 | 0.86248 | 0.87021 | 0.88223 | |
SMOTEb1 | 0.87003 | 0.02088 | 50 | b | 0.79088 | 0.91191 | 0.85906 | 0.86786 | 0.88263 | |
None | 0.86773 | 0.00890 | 10 | b | 0.85090 | 0.88340 | 0.86338 | 0.86753 | 0.87333 | |
ADASYN | 0.56017 | 0.15056 | 50 | c | 0.33384 | 0.87529 | 0.45101 | 0.51294 | 0.67599 |
Learner | Sampling | GM | std | r | g | Min | Max | Q25 | Q50 | Q75 |
---|---|---|---|---|---|---|---|---|---|---|
(b) GM | ||||||||||
 GBT | RUS | 0.86382 | 0.18048 | 50 | a | 0.24356 | 0.97083 | 0.79393 | 0.94445 | 0.97021 |
SMOTE | 0.48838 | 0.16479 | 50 | b | 0.08179 | 0.64737 | 0.33031 | 0.60297 | 0.60528 | |
ROS | 0.47668 | 0.13952 | 50 | b | 0.23670 | 0.64740 | 0.33031 | 0.50990 | 0.60513 | |
SMOTEb1 | 0.29887 | 0.11561 | 50 | c | 0.23672 | 0.56217 | 0.24196 | 0.24197 | 0.24369 | |
SMOTEb2 | 0.27735 | 0.08665 | 50 | c | 0.23672 | 0.56291 | 0.24196 | 0.24197 | 0.24369 | |
None | 0.25168 | 0.10408 | 10 | c | 0.08180 | 0.51158 | 0.23672 | 0.24196 | 0.24326 | |
ADASYN | 0.22760 | 0.11021 | 50 | c | 0.05009 | 0.33034 | 0.07652 | 0.24369 | 0.33026 | |
 LR | RUS | 0.85284 | 0.15640 | 50 | a | 0.38133 | 0.97066 | 0.72452 | 0.91562 | 0.97001 |
ROS | 0.70219 | 0.07309 | 50 | b | 0.44269 | 0.72367 | 0.72336 | 0.72338 | 0.72338 | |
SMOTE | 0.69947 | 0.08142 | 50 | b | 0.38133 | 0.72455 | 0.72336 | 0.72338 | 0.72365 | |
None | 0.64486 | 0.00017 | 10 | bc | 0.64473 | 0.64506 | 0.64473 | 0.64473 | 0.64506 | |
SMOTEb1 | 0.64299 | 0.00804 | 50 | bc | 0.60116 | 0.64570 | 0.64473 | 0.64473 | 0.64506 | |
SMOTEb2 | 0.63823 | 0.01460 | 50 | c | 0.58532 | 0.64506 | 0.64473 | 0.64473 | 0.64473 | |
ADASYN | 0.47147 | 0.15695 | 50 | d | 0.37992 | 0.91572 | 0.38026 | 0.38128 | 0.46906 | |
 RF | RUS | 0.45341 | 0.34982 | 50 | a | 0 | 0.96438 | 0 | 0.63766 | 0.64780 |
SMOTE | 0.43540 | 0.25934 | 50 | a | 0 | 0.71944 | 0.37773 | 0.63642 | 0.64454 | |
ROS | 0.42211 | 0.24530 | 50 | a | 0 | 0.72048 | 0.37757 | 0.38138 | 0.64454 |