| Home

Overview


Original Research

GP-MOMS BASED CLASSIFICATION OF MULTI-CLASS IMBALANCED DATA

RADHIKA KOTECHA

Vol 17, No 11 ( 2022 )   |  DOI: 10.5281/zenodo.7388773   |   Author Affiliation: Department of Information Technology, K. J. Somaiya Institute of Engineering and Information Technology, University of Mumbai, India.   |   Licensing: CC 4.0   |   Pg no: 1981-1995   |   To cite: RADHIKA KOTECHA. (2022). GP-MOMS BASED CLASSIFICATION OF MULTI-CLASS IMBALANCED DATA. 17(11), 1981–1995. https://doi.org/10.5281/zenodo.7388773   |   Published on: 30-11-2022

Abstract

Prediction of rarely occurring patterns is challenging but crucial for several real-world applications like healthcare, fraud detection, etc. However, for datasets with imbalanced class distribution, the traditional techniques in Machine Learning focus mainly on frequently occurring patterns, and exhibit poor performance in classifying instances of underrepresented classes present in minority. Further, most research in this field focuses on binary classes only. But, several applications of interest involve multiple classes, which is much more complex than learning from bi-class imbalanced datasets. Hence, the proposed work addresses the issue of multi-class imbalanced data classification through a generic framework suitable for all application areas. Firstly, the work extends the bi-class evaluation measures to multi-class datasets for unbiased performance analysis. Further, a sampling and Genetic Programming based approach named GP-MOMS is proposed for efficient classification of multi-class imbalanced data, especially the rare patterns. Performance comparison with related benchmark techniques on standard datasets proves the efficacy of the proposed approach, which is presented in this work.


Keywords

Classification, Minority Classes, Imbalanced Datasets, Multi-Class, Genetic Programming, Sampling.