Universiti Teknologi Malaysia Institutional Repository

Fuzzy C-mean missing data imputation for analogy-based effort estimation

Al Mutlaq, Ayman Jalal and Abang Jawawi, Dayang Norhayati and Arbain, Adila Firdaus (2021) Fuzzy C-mean missing data imputation for analogy-based effort estimation. International Journal of Advanced Computer Science and Applications, 12 (8). pp. 628-640. ISSN 2158-107X

[img]
Preview
PDF
907kB

Official URL: http://dx.doi.org/10.14569/IJACSA.2021.0120874

Abstract

The accuracy of effort estimation in one of the major factors in the success or failure of software projects. Analogy-Based Estimation (ABE) is a widely accepted estimation model since its flow human nature in selecting analogies similar in nature to the target project. The accuracy of prediction in ABE model in strongly associated with the quality of the dataset since it depends on previous completed projects for estimation. Missing Data (MD) is one of major challenges in software engineering datasets. Several missing data imputation techniques have been investigated by researchers in ABE model. Identification of the most similar donor values from the completed software projects dataset for imputation is a challenging issue in existing missing data techniques adopted for ABE model. In this study, Fuzzy C-Mean Imputation (FCMI), Mean Imputation (MI) and K-Nearest Neighbor Imputation (KNNI) are investigated to impute missing values in Desharnais dataset under different missing data percentages (Desh-Miss1, Desh-Miss2) for ABE model. FCMI-ABE technique is proposed in this study. Evaluation comparison among MI, KNNI, and (ABE-FCMI) is conducted for ABE model to identify the suitable MD imputation method. The results suggest that the use of (ABE-FCMI), rather than MI and KNNI, imputes more reliable values to incomplete software projects in the missing datasets. It was also found that the proposed imputation method significantly improves software development effort prediction of ABE model.

Item Type:Article
Uncontrolled Keywords:fuzzy c-mean, imputation, missing data
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computing
ID Code:94981
Deposited By: Yanti Mohd Shah
Deposited On:29 Apr 2022 22:32
Last Modified:29 Apr 2022 22:32

Repository Staff Only: item control page