Universiti Teknologi Malaysia Institutional Repository

MINN: a missing data imputation technique for analogy-based effort estimation

Shah, Muhammad Arif and Jawawi, Dayang N. A. and Isa, Mohd. Adham and Wakil, Karzan and Younas, Muhammad and Mustafa, Ahmed (2019) MINN: a missing data imputation technique for analogy-based effort estimation. International Journal of Advanced Computer Science and Applications, 10 (2). pp. 222-232. ISSN 2158-107X

[img]
Preview
PDF
1MB

Official URL: http://dx.doi.org/10.14569/ijacsa.2019.0100230

Abstract

Success and failure of a complex software project are strongly associated with the accurate estimation of development effort. There are numerous estimation models developed but the most widely used among those is Analogy- Based Estimation (ABE). ABE model follows human nature as it estimates the future project's effort by making analogies with the past project's data. Since ABE relies on the historical datasets, the quality of the datasets affects the accuracy of estimation. Most of the software engineering datasets have missing values. The researchers either delete the projects containing missing values or avoid treating the missing values which reduce the ABE performance. In this study, Numeric Cleansing (NC), K-Nearest Neighbor Imputation (KNNI) and Median Imputation of the Nearest Neighbor (MINN) methods are used to impute the missing values in Desharnais and DesMiss datasets for ABE. MINN technique is introduced in this study. A comparison among these imputation methods is performed to identify the suitable missing data imputation method for ABE. The results suggested that MINN imputes more realistic values in the missing datasets as compared to values imputed through NC and KNNI. It was also found that the imputation treatment method helped in better prediction of the software development effort on ABE model.

Item Type:Article
Uncontrolled Keywords:effort estimation, missing data imputation, software development
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computing
ID Code:88963
Deposited By: Yanti Mohd Shah
Deposited On:26 Jan 2021 08:36
Last Modified:26 Jan 2021 08:36

Repository Staff Only: item control page