Universiti Teknologi Malaysia Institutional Repository

Investigating group distributionally robust optimization for deep imbalanced learning: a case study of binary tabular data classification.

Mustapha, Ismail and Hasan, Shafaatunnur and Nabbus, Hatem S. Y. and Montaser, Mohamed Mostafa Ali and Olatunji, Sunday Olusanya and Shamsuddin, Siti Maryam (2023) Investigating group distributionally robust optimization for deep imbalanced learning: a case study of binary tabular data classification. International Journal Of Advanced Computer Science And Applications, 14 (2). pp. 739-748. ISSN 2158-107X

[img] PDF
1MB

Official URL: http://dx.doi.org/10.14569/IJACSA.2023.0140286

Abstract

One of the most studied machine learning challenges that recent studies have shown the susceptibility of deep neural networks to is the class imbalance problem. While concerted research efforts in this direction have been notable in recent years, findings have shown that the canonical learning objective, empirical risk minimization (ERM), is unable to achieve optimal imbalance learning in deep neural networks given its bias to the majority class. An alternative learning objective, group distributionally robust optimization (gDRO), is investigated in this study for imbalance learning, focusing on tabular imbalanced data as against image data that has dominated deep imbalance learning research. Contrary to minimizing average per instance loss as in ERM, gDRO seeks to minimize the worst group loss over the training data. Experimental findings in comparison with ERM and classical imbalance methods using four popularly used evaluation metrics in imbalance learning across several benchmark imbalance binary tabular data of varying imbalance ratios reveal impressive performance of gDRO, outperforming other compared methods in terms of g-mean and roc-auc.

Item Type:Article
Uncontrolled Keywords:Class imbalance; deep neural networks; empirical risk minimization; group distributionally robust optimization; tabular data
Subjects:T Technology > T Technology (General)
T Technology > T Technology (General) > T55-55.3 Industrial Safety. Industrial Accident Prevention
Divisions:Computer Science and Information System
ID Code:105382
Deposited By: Muhamad Idham Sulong
Deposited On:24 Apr 2024 06:45
Last Modified:24 Apr 2024 06:45

Repository Staff Only: item control page