Hussain, Saddam and Mustafa, Mohd. Wazir and A. Jumani, Touqeer and Baloch, Shadi Khan and Alotaibi, Hammad and Khan, Ilyas and Khan, Afrasyab (2021) A novel feature engineered-CatBoost-based supervised machine learning framework for electricity theft detection. Energy Reports, 7 . pp. 4425-4436. ISSN 2352-4847
|
PDF
739kB |
Official URL: http://dx.doi.org/10.1016/j.egyr.2021.07.008
Abstract
This paper presents a novel supervised machine learning-based electric theft detection approach using the feature engineered-CatBoost algorithm in conjunction with the SMOTETomek algorithm. Contrary to the previous literature, where the missing observations in data are either ignored or imputed with average values, this work utilizes k-Nearest neighbor technique for missing data imputation; thus, an accurate and realistic estimation of the missing data is achieved. To mitigate the biasness to the majority data class, the proposed model utilizes the SMOTETomek algorithm, which neutralizes the mentioned effect by managing a proper balance between over-sampling and under-sampling techniques.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | CatBoost algorithm, feature engineering, machine learning model interpretation |
Subjects: | T Technology > TK Electrical engineering. Electronics Nuclear engineering |
Divisions: | Electrical Engineering |
ID Code: | 95358 |
Deposited By: | Yanti Mohd Shah |
Deposited On: | 29 Apr 2022 22:33 |
Last Modified: | 29 Apr 2022 22:33 |
Repository Staff Only: item control page