Universiti Teknologi Malaysia Institutional Repository

Ensemble synthesized minority oversampling-based generative adversarial networks and random forest algorithm for credit card fraud detection.

Ghaleb, Fuad A. and Saeed, Faisal and Al-Sarem, Mohammed and Qasem, Sultan Noman and Al-Hadhrami, Tawfik (2023) Ensemble synthesized minority oversampling-based generative adversarial networks and random forest algorithm for credit card fraud detection. IEEE Access, 11 . pp. 89694-89710. ISSN 2169-3536

[img] PDF
1MB

Official URL: http://dx.doi.org/10.1109/ACCESS.2023.3306621

Abstract

The recent increase in credit card fraud is rapidly has caused huge monetary losses for individuals and financial institutions. Most credit card frauds are conducted online by illegally obtaining payment credentials through data breaches, phishing, or scamming. Many solutions have been suggested to address the credit card fraud problem for online transactions. However, the high-class imbalance is the major challenge that faces the existing solutions to construct an effective detection model. Most of the existing techniques used for class imbalance overestimate the distribution of the minority class, resulting in highly overlapped or noisy and unrepresentative features, which cause either overfitting or imprecise learning. In this study, a credit card fraud detection model (CCFDM) is proposed based on ensemble learning and a generative adversarial network (GAN) assisted by Ensemble Synthesized Minority Oversampling techniques (ESMOTE-GAN). Multiple subsets were extracted using under-sampling and SMOTE was applied to generate less skewed sets to prevent the GAN from modeling the noise. These subsets were used to train diverse sets of GAN models to generate the synthesized subsets. A set of Random Forest classifiers was then trained based on the proposed ESMOTE-GAN technique. The probabilistic outputs of the trained classifiers were combined using a weighted voting scheme for decision-making. The results show that the proposed model achieved 1.9%, and 3.2% improvements in overall performance and the detection rate, respectively, with a 0% false alarm rate. Due to the massive number of transactions, even a tiny false positive rate can overwhelm the analysis team. Thus, the proposed model has improved the detection performance and reduced the cost needed for manual analysis.

Item Type:Article
Uncontrolled Keywords:Class imbalance; credit card fraud detection; GAN; Random Forest; SMOTE
Subjects:T Technology > T Technology (General)
T Technology > T Technology (General) > T58.6-58.62 Management information systems
Divisions:Computer Science and Information System
ID Code:104903
Deposited By: Muhamad Idham Sulong
Deposited On:02 Apr 2024 06:30
Last Modified:02 Apr 2024 06:30

Repository Staff Only: item control page