Universiti Teknologi Malaysia Institutional Repository

A novel feature engineered-CatBoost-based supervised machine learning framework for electricity theft detection

Hussain, Saddam and Mustafa, Mohd. Wazir and A. Jumani, Touqeer and Baloch, Shadi Khan and Alotaibi, Hammad and Khan, Ilyas and Khan, Afrasyab (2021) A novel feature engineered-CatBoost-based supervised machine learning framework for electricity theft detection. Energy Reports, 7 . pp. 4425-4436. ISSN 2352-4847

[img]
Preview
PDF
739kB

Official URL: http://dx.doi.org/10.1016/j.egyr.2021.07.008

Abstract

This paper presents a novel supervised machine learning-based electric theft detection approach using the feature engineered-CatBoost algorithm in conjunction with the SMOTETomek algorithm. Contrary to the previous literature, where the missing observations in data are either ignored or imputed with average values, this work utilizes k-Nearest neighbor technique for missing data imputation; thus, an accurate and realistic estimation of the missing data is achieved. To mitigate the biasness to the majority data class, the proposed model utilizes the SMOTETomek algorithm, which neutralizes the mentioned effect by managing a proper balance between over-sampling and under-sampling techniques.

Item Type:Article
Uncontrolled Keywords:CatBoost algorithm, feature engineering, machine learning model interpretation
Subjects:T Technology > TK Electrical engineering. Electronics Nuclear engineering
Divisions:Electrical Engineering
ID Code:95358
Deposited By: Yanti Mohd Shah
Deposited On:29 Apr 2022 22:33
Last Modified:29 Apr 2022 22:33

Repository Staff Only: item control page