TY - STD TI - Japkowicz N. Learning from imbalanced data sets: a comparison of various strategies. In: AAAI workshop on learning from imbalanced data sets, Vol. 68. 2000. p. 10–5. ID - ref1 ER - TY - JOUR AU - Batista, G. E. AU - Prati, R. C. AU - Monard, M. C. PY - 2004 DA - 2004// TI - A study of the behavior of several methods for balancing machine learning training data JO - ACM SIGKDD Explorations Newsl VL - 6 ID - Batista2004 ER - TY - STD TI - Shatnawi R. Improving software fault-prediction for imbalanced data. In: 2012 international conference on innovations in information technology (IIT); 2012. p. 54–9. ID - ref3 ER - TY - STD TI - Di Martino M, Decia F, Molinelli J, Fernández A. Improving electric fraud detection using class imbalance strategies. In: ICPRAM; 2012. p. 135–41. ID - ref4 ER - TY - JOUR AU - Majid, A. AU - Ali, S. AU - Iqbal, M. AU - Kausar, N. PY - 2014 DA - 2014// TI - Prediction of human breast and colon cancers from imbalanced data using nearest neighbor and support vector machines JO - Comput Methods Programs Biomed VL - 113 ID - Majid2014 ER - TY - JOUR AU - Liu, Y. AU - Loh, H. T. AU - Sun, A. PY - 2009 DA - 2009// TI - Imbalanced text classification: a term weighting approach JO - Expert Syst Appl VL - 36 ID - Liu2009 ER - TY - JOUR AU - Kubat, M. AU - Holte, R. C. AU - Matwin, S. PY - 1998 DA - 1998// TI - Machine learning for the detection of oil spills in satellite radar images JO - Mach Learn VL - 30 ID - Kubat1998 ER - TY - STD TI - Su P, Mao W, Zeng D, Li X, Wang FY. Handling class imbalance problem in cultural modeling. In: 2009 IEEE international conference on intelligence and security informatics; 2009. p. 251–6. ID - ref8 ER - TY - JOUR AU - Abdi, Y. AU - Parsa, S. AU - Seyfari, Y. PY - 2015 DA - 2015// TI - A hybrid one-class rule learning approach based on swarm intelligence for software fault prediction JO - Innovations Syst Softw Eng VL - 11 ID - Abdi2015 ER - TY - JOUR AU - Ganganwar, V. PY - 2012 DA - 2012// TI - An overview of classification algorithms for imbalanced datasets JO - Int J Emerg Technol Adv Eng VL - 2 ID - Ganganwar2012 ER - TY - JOUR AU - Kotsiantis, S. AU - Kanellopoulos, D. AU - Pintelas, P. PY - 2006 DA - 2006// TI - Handling imbalanced datasets: a review JO - GESTS Int Trans Computer Sci Eng VL - 30 ID - Kotsiantis2006 ER - TY - STD TI - Ferreira AJ, Figueiredo MA. Boosting algorithms: a review of methods, theory, and applications. In: Ensemble machine learning. Boston: Springer; 2012. p. 35–85. ID - ref12 ER - TY - JOUR AU - Wang, S. AU - Yao, X. PY - 2012 DA - 2012// TI - Multiclass imbalance problems: analysis and potential solutions JO - IEEE Trans Syst Man Cybern VL - 42 ID - Wang2012 ER - TY - JOUR AU - Bi, J. AU - Zhang, C. PY - 2018 DA - 2018// TI - An empirical comparison on state-of-the-art multi-class imbalance learning algorithms and a new diversified ensemble learning scheme JO - Knowl-Based Syst VL - 15 ID - Bi2018 ER - TY - JOUR AU - Wu, K. AU - Zheng, Z. AU - Tang, S. PY - 2017 DA - 2017// TI - BVDT: A boosted vector decision tree algorithm for multi-class classification problems JO - Int J Pattern Recognit Artif Intell VL - 31 ID - Wu2017 ER - TY - JOUR AU - Leevy, J. L. AU - Khoshgoftaar, T. M. AU - Bauder, R. A. AU - Seliya, N. PY - 2018 DA - 2018// TI - A survey on addressing high-class imbalance in big data JO - J Big Data VL - 5 ID - Leevy2018 ER - TY - JOUR AU - Abu-Salih, B. AU - Chan, K. Y. AU - Al-Kadi, O. AU - Al-Tawil, M. AU - Wongthongtham, P. AU - Issa, T. AU - Saadeh, H. AU - Al-Hassan, M. AU - Bremie, B. AU - Albahlal, A. PY - 2020 DA - 2020// TI - Time-aware domain-based social influence prediction JO - J Big Data VL - 7 ID - Abu-Salih2020 ER - TY - STD TI - Sleeman IV WC, Krawczyk B. Bagging Using Instance-Level Difficulty for Multi-Class Imbalanced Big Data Classification on Spark. In2019 IEEE International Conference on Big Data (Big Data) 2019 (pp. 2484–2493). IEEE. ID - ref18 ER - TY - STD TI - Sun Y, Kamel MS, Wang Y. Boosting for learning multiple classes with imbalanced class distribution. In: Sixth international conference on data mining (ICDM'06); 2006. p. 592–602. ID - ref19 ER - TY - JOUR AU - Zhen, L. AU - Qiong, L. PY - 2012 DA - 2012// TI - A new feature selection method for internet traffic classification using ml JO - Phys Procedia VL - 1 ID - Zhen2012 ER - TY - JOUR AU - Ling, C. X. AU - Huang, J. AU - Zhang, H. PY - 2003 DA - 2003// TI - AUC: a statistically consistent and more discriminating measure than accuracy JO - Ijcai VL - 3 ID - Ling2003 ER - TY - JOUR AU - Huang, J. AU - Ling, C. X. PY - 2005 DA - 2005// TI - Using AUC and accuracy in evaluating learning algorithms JO - IEEE Trans Knowl Data Eng VL - 17 ID - Huang2005 ER - TY - JOUR AU - Singh, A. AU - Purohit, A. PY - 2015 DA - 2015// TI - A survey on methods for solving data imbalance problem for classification JO - Int J Computer Appl VL - 127 ID - Singh2015 ER - TY - JOUR AU - FernáNdez, A. AU - LóPez, V. AU - Galar, M. AU - Jesus, M. J. AU - Herrera, F. PY - 2013 DA - 2013// TI - Analysing the classification of imbalanced data-sets with multiple classes: Binarization techniques and ad-hoc approaches JO - Knowl-Based Syst VL - 1 ID - FernáNdez2013 ER - TY - JOUR AU - Krawczyk, B. PY - 2016 DA - 2016// TI - Learning from imbalanced data: open challenges and future directions JO - Prog Artif Intell VL - 5 ID - Krawczyk2016 ER - TY - JOUR AU - Tahir, M. A. AU - Asghar, S. AU - Manzoor, A. AU - Noor, M. A. PY - 2019 DA - 2019// TI - A classification model for class imbalance dataset using genetic programming JO - IEEE Access VL - 8 ID - Tahir2019 ER - TY - JOUR AU - Ramentol, E. AU - Caballero, Y. AU - Bello, R. AU - Herrera, F. PY - 2012 DA - 2012// TI - SMOTE-RSB*: a hybrid preprocessing approach based on oversampling and undersampling for high imbalanced data-sets using SMOTE and rough sets theory JO - Knowl Inf Syst VL - 33 ID - Ramentol2012 ER - TY - STD TI - Liu A, Ghosh J, Martin CE. Generative oversampling for mining imbalanced datasets. In: DMIN; 2007. p. 66–72. ID - ref28 ER - TY - STD TI - Kumari C, Abulaish M, Subbarao N. Using SMOTE to deal with class-imbalance problem in bioactivity data to predict mTOR inhibitors. In: Proceedings of the international conference on adaptive computational intelligence (ICACI), Mysuru, India; 2019. p. 1–12. ID - ref29 ER - TY - JOUR AU - Colton, D. AU - Hofmann, M. PY - 2019 DA - 2019// TI - Sampling techniques to overcome class imbalance in a cyberbullying context JO - J Computer-Assist Linguistic Res VL - 3 ID - Colton2019 ER - TY - STD TI - Esteves VM. Techniques to deal with imbalanced data in multi-class problems: a review of existing methods. ID - ref31 ER - TY - JOUR AU - Ling, C. X. AU - Sheng, V. S. PY - 2008 DA - 2008// TI - Cost-sensitive learning and the class imbalance problem JO - Encyclopedia Mach Learn VL - 2011 ID - Ling2008 ER - TY - JOUR AU - Maheshwari, S. AU - Agrawal, J. AU - Sharma, S. PY - 2011 DA - 2011// TI - New approach for classification of highly imbalanced datasets using evolutionary algorithms JO - Int J Sci Eng Res VL - 2 ID - Maheshwari2011 ER - TY - JOUR AU - Błaszczyński, J. AU - Stefanowski, J. PY - 2015 DA - 2015// TI - Neighbourhood sampling in bagging for imbalanced data JO - Neurocomputing VL - 20 ID - Błaszczyński2015 ER - TY - JOUR AU - Rokach, L. PY - 2010 DA - 2010// TI - Ensemble-based classifiers JO - Artif Intell Rev VL - 33 ID - Rokach2010 ER - TY - JOUR AU - Schapire, R. E. PY - 1999 DA - 1999// TI - A brief introduction to boosting JO - Ijcai VL - 99 ID - Schapire1999 ER - TY - JOUR AU - Breiman, L. PY - 1996 DA - 1996// TI - Bagging predictors JO - Mach Learn VL - 24 ID - Breiman1996 ER - TY - JOUR AU - Galar, M. AU - Fernandez, A. AU - Barrenechea, E. AU - Bustince, H. AU - Herrera, F. PY - 2011 DA - 2011// TI - A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches JO - IEEE Trans Syst Man Cybern. VL - 42 ID - Galar2011 ER - TY - JOUR AU - Zhang, Z. AU - Krawczyk, B. AU - Garcìa, S. AU - Rosales-Pérez, A. AU - Herrera, F. PY - 2016 DA - 2016// TI - Empowering one-vs-one decomposition with ensemble learning for multi-class imbalanced data JO - Knowl-Based Syst VL - 15 ID - Zhang2016 ER - TY - STD TI - Krawczyk B. Combining one-vs-one decomposition and ensemble learning for multi-class imbalanced data. In: Proceedings of the 9th international conference on computer recognition systems CORES 2015. Cham: Springer; 2016. p. 27–36. ID - ref40 ER - TY - JOUR AU - Feng, W. AU - Huang, W. AU - Ren, J. PY - 2018 DA - 2018// TI - Class imbalance ensemble learning based on the margin theory JO - Appl Sci VL - 8 ID - Feng2018 ER - TY - JOUR AU - Schapire, R. E. AU - Singer, Y. PY - 2000 DA - 2000// TI - BoosTexter: A boosting-based system for text categorization JO - Mach Learn VL - 39 ID - Schapire2000 ER - TY - STD TI - Freund Y, Schapire RE. A decision-theoretic generalization of on-line learning and an application to boosting. In: European conference on computational learning theory. Heidelberg: Springer; 1995. p. 23–37. ID - ref43 ER - TY - JOUR AU - Hastie, T. AU - Rosset, S. AU - Zhu, J. AU - Zou, H. PY - 2009 DA - 2009// TI - Multi-class adaboost JO - Stat Interface VL - 2 ID - Hastie2009 ER - TY - JOUR AU - Friedman, J. AU - Hastie, T. AU - Tibshirani, R. PY - 2000 DA - 2000// TI - Additive logistic regression: a statistical view of boosting (with discussion and a rejoinder by the authors) JO - Ann Stat VL - 28 ID - Friedman2000 ER - TY - JOUR AU - Sun, P. AU - Reid, M. D. AU - Zhou, J. PY - 2014 DA - 2014// TI - An improved multiclass LogitBoost using adaptive-one-vs-one JO - Mach Learn VL - 97 ID - Sun2014 ER - TY - STD TI - Li P. Abc-logitboost for multi-class classification. arXiv preprint: arXiv:0908.4144. 2009. ID - ref47 ER - TY - STD TI - Sun P, Reid MD, Zhou J. Aoso-logitboost: Adaptive one-vs-one logitboost for multi-class problem. arXiv preprint: arXiv:1110.3907. 2011. ID - ref48 ER - TY - JOUR AU - Friedman, J. H. PY - 2001 DA - 2001// TI - Greedy function approximation: a gradient boosting machine JO - Ann Stat VL - 1 ID - Friedman2001 ER - TY - STD TI - Prokhorenkova L, Gusev G, Vorobev A, Dorogush AV, Gulin A. CatBoost: unbiased boosting with categorical features. In: Advances in neural information processing systems. 2018. p. 6638–48. ID - ref50 ER - TY - STD TI - Chen T, Guestrin C. Xgboost: A scalable tree boosting system. In: Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining; 2016. p. 785–94. ID - ref51 ER - TY - STD TI - Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, Ye Q, Liu TY. Lightgbm: A highly efficient gradient boosting decision tree. In: Advances in neural information processing systems; 2017. p. 3146–54. ID - ref52 ER - TY - STD TI - Chawla NV, Lazarevic A, Hall LO, Bowyer KW. SMOTEBoost: Improving prediction of the minority class in boosting. In: European conference on principles of data mining and knowledge discovery. Springer: Berlin; 2003. p. 107–19 ID - ref53 ER - TY - JOUR AU - Seiffert, C. AU - Khoshgoftaar, T. M. AU - Hulse, J. AU - Napolitano, A. PY - 2009 DA - 2009// TI - RUSBoost: A hybrid approach to alleviating class imbalance JO - IEEE Trans Syst Man Cybern Syst Hum VL - 40 ID - Seiffert2009 ER - TY - STD TI - Rayhan F, Ahmed S, Mahbub A, Jani MR, Shatabda S, Farid DM, Rahman CM. MEBoost: mixing estimators with boosting for imbalanced data classification. In: 2017 11th international conference on software, knowledge, information management and applications (SKIMA); 2017. p. 1–6. ID - ref55 ER - TY - JOUR AU - Sun, Y. AU - Kamel, M. S. AU - Wong, A. K. AU - Wang, Y. PY - 2007 DA - 2007// TI - Cost-sensitive boosting for classification of imbalanced data JO - Pattern Recogn VL - 40 ID - Sun2007 ER - TY - JOUR AU - Fan, W. AU - Stolfo, S. J. AU - Zhang, J. AU - Chan, P. K. PY - 1999 DA - 1999// TI - AdaCost: misclassification cost-sensitive boosting JO - Icml VL - 99 ID - Fan1999 ER - TY - STD TI - Ting KM. A comparative study of cost-sensitive boosting algorithms. In: Proceedings of the 17th international conference on machine learning. 2000. ID - ref58 ER - TY - STD TI - Domingo C, Watanabe O. MadaBoost: A modification of AdaBoost. In: COLT; 2000. p. 180–9. ID - ref59 ER - TY - STD TI - Joshi MV, Agarwal RC, Kumar V. Predicting rare classes: can boosting make any weak learner strong? In: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining; 2002. p. 297–306. ID - ref60 ER - TY - STD TI - Joshi MV, Kumar V, Agarwal RC. Evaluating boosting algorithms to classify rare classes: comparison and improvements. In: Proceedings 2001 IEEE international conference on data mining; 2001. p. 257–64. ID - ref61 ER - TY - JOUR AU - Vezhnevets, A. AU - Vezhnevets, V. PY - 2005 DA - 2005// TI - Modest AdaBoost-teaching AdaBoost to generalize better JO - Graphicon VL - 12 ID - Vezhnevets2005 ER - TY - JOUR AU - Mease, D. AU - Wyner, A. AU - Buja, A. PY - 2007 DA - 2007// TI - Cost-weighted boosting with jittering and over/under-sampling: Jous-boost JO - J Mach Learn Res VL - 8 ID - Mease2007 ER - TY - STD TI - Jin X, Hou X, Liu CL. Multi-class AdaBoost with hypothesis margin. In: 2010 20th international conference on pattern recognition. 2010. p. 65–8. ID - ref64 ER - TY - JOUR AU - Chen, S. AU - He, H. AU - Garcia, E. A. PY - 2010 DA - 2010// TI - RAMOBoost: ranked minority oversampling in boosting JO - IEEE Trans Neural Netw VL - 21 ID - Chen2010 ER - TY - STD TI - Saberian MJ, Vasconcelos N. Multiclass boosting: theory and algorithms. In: Advances in neural information processing systems; 2011. p. 2124–32. ID - ref66 ER - TY - JOUR AU - Galar, M. AU - Fernández, A. AU - Barrenechea, E. AU - Herrera, F. PY - 2013 DA - 2013// TI - EUSBoost: enhancing ensembles for highly imbalanced data-sets by evolutionary undersampling JO - Pattern Recogn VL - 46 ID - Galar2013 ER - TY - JOUR AU - Díez-Pastor, J. F. AU - Rodríguez, J. J. AU - García-Osorio, C. AU - Kuncheva, L. I. PY - 2015 DA - 2015// TI - Random balance: ensembles of variable priors classifiers for imbalanced data JO - Knowl-Based Syst VL - 1 ID - Díez-Pastor2015 ER - TY - STD TI - Ahmed S, Rayhan F, Mahbub A, Jani MR, Shatabda S, Farid DM. LIUBoost: locality informed under-boosting for imbalanced data classification. In: Emerging technologies in data mining and information security. Singapore: Springer; 2019. p. 133–44. ID - ref69 ER - TY - JOUR AU - Kumar, S. AU - Biswas, S. K. AU - Devi, D. PY - 2019 DA - 2019// TI - TLUSBoost algorithm: a boosting solution for class imbalance problem JO - Soft Comput VL - 23 ID - Kumar2019 ER - TY - JOUR AU - Deng, X. AU - Liu, Q. AU - Deng, Y. AU - Mahadevan, S. PY - 2016 DA - 2016// TI - An improved method to construct basic probability assignment based on the confusion matrix for classification problem JO - Inf Sci VL - 1 ID - Deng2016 ER - TY - JOUR AU - Chicco, D. AU - Jurman, G. PY - 2020 DA - 2020// TI - The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation JO - BMC Genomics VL - 21 ID - Chicco2020 ER - TY - STD TI - Saito T, Rehmsmeier M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS ONE. 2015;10:3. ID - ref73 ER - TY - STD TI - Halimu C, Kasem A, Newaz SS. Empirical Comparison of Area under ROC curve (AUC) and Mathew Correlation Coefficient (MCC) for evaluating machine learning algorithms on imbalanced datasets for binary classification. In: Proceedings of the 3rd international conference on machine learning and soft computing; 2019. p. 1–6. ID - ref74 ER - TY - JOUR AU - Rahman, M. S. AU - Rahman, M. K. AU - Kaykobad, M. AU - Rahman, M. S. PY - 2018 DA - 2018// TI - isGPT: An optimized model to identify sub-Golgi protein types using SVM and Random Forest based feature selection JO - Artif Intell Med VL - 1 ID - Rahman2018 ER - TY - JOUR AU - Jurman, G. AU - Riccadonna, S. AU - Furlanello, C. PY - 2012 DA - 2012// TI - A comparison of MCC and CEN error measures in multi-class prediction JO - PLoS ONE. VL - 7 ID - Jurman2012 ER - TY - JOUR AU - Zhang, Z. L. AU - Luo, X. G. AU - García, S. AU - Tang, J. F. AU - Herrera, F. PY - 2017 DA - 2017// TI - Exploring the effectiveness of dynamic ensemble selection in the one-versus-one scheme JO - Knowl-Based Syst VL - 1 ID - Zhang2017 ER - TY - JOUR AU - Singh, P. K. AU - Sarkar, R. AU - Nasipuri, M. PY - 2016 DA - 2016// TI - Significance of non-parametric statistical tests for comparison of classifiers over multiple datasets JO - Int J Comput Sci Math VL - 7 ID - Singh2016 ER - TY - JOUR AU - Demšar, J. PY - 2006 DA - 2006// TI - Statistical comparisons of classifiers over multiple data sets JO - J Mach Learn Res VL - 7 ID - Demšar2006 ER - TY - JOUR AU - Wilcoxon, F. AU - Katti, S. K. AU - Wilcox, R. A. PY - 1970 DA - 1970// TI - Critical values and probability levels for the Wilcoxon rank sum test and the Wilcoxon signed rank test JO - Selected Tables Math Stat VL - 1 ID - Wilcoxon1970 ER -