Universiti Teknologi Malaysia Institutional Repository

Empirical performance evaluation of imputation techniques using medical dataset

Alade, O. A. and Sallehuddin, R. and Selamat, A. (2019) Empirical performance evaluation of imputation techniques using medical dataset. In: International Conference on Green Engineering Technology and Applied Computing 2019, IConGETech2 019 and International Conference on Applied Computing 2019, ICAC 2019, 4-5 Feb 2019, Eastin Hotel Makkasan, Bangkok, Thailand.

[img]
Preview
PDF
969kB

Official URL: https://dx.doi.org/10.1088/1757-899X/551/1/012055

Abstract

This paper evaluates the error measures of missing value imputations in medical research. Several imputation techniques have been designed and implemented, however, the evaluation of the degree of deviation of the imputed values from the original values have not been given adequate attention. Predictive Mean Matching Imputation (PMMI) and K-Nearest Neighbour Imputation (KNNI) techniques were implemented on imputation of fertility dataset. The implementation was on three mechanisms of missing values: Missing At Random (MAR), Missing Completely At Random (MCAR) and Missing Not At Random (MNAR). The results were evaluated by mean square error (MSE), root mean square error (RMSE) and mean absolute error (MAE). PMMI performed better than KNNI in all the results. MSE for example, has the ratio of 0.0260/2.8555 (PMMI/KNNI) for 1-10% MAR - 99.09% reduced error rate; 0.1108/3.0120 (PMMI/KNNI) for 30-40% MCAR - 96.32 reduced error rate; and 0.0642/3.7187 (PMMI/KNNI) for 40-50% MNAR - 98.27% reduced error rate. MCAR was the most consistent missingness mechanism for the evaluations. Density distributions of the imputed dataset were compared with the original dataset. The distribution plots of the imputed missing data followed the curve of the original dataset.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:errors, green computing, nearest neighbor search
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computing
ID Code:89908
Deposited By: Narimah Nawil
Deposited On:04 Mar 2021 02:45
Last Modified:04 Mar 2021 02:45

Repository Staff Only: item control page