Universiti Teknologi Malaysia Institutional Repository

Enhancing generic pipeline model for code clone detection using divide and conquer approach

Mubarak-Ali, Al-Fahim and Syed-Mohamad, Sharifah and Sulaiman, Shahida (2015) Enhancing generic pipeline model for code clone detection using divide and conquer approach. International Arab Journal of Information Technology, 12 (5). pp. 510-517. ISSN 1683-3198

Full text not available from this repository.

Abstract

Code clone is known as identical copies of the same instances or fragments of source codes in software. Current code clone research focuses on the detection and analysis of code clones in order to help software developers identify code clones in source codes and reuse the source codes in order to decrease the maintenance cost. Many approaches such as textual based comparison approach, token based comparison and tree based comparison approach have been used to detect code clones. As software grows and becomes a legacy system, the complexity of these approaches in detecting code clones increases. Thus, this scenario makes it more difficult to detect code clones. Generic pipeline model is the most recent code clone detection that comprises five processes which are parsing process, pre-processing process, pooling process, comparing processes and filtering process to detect code clone. This research highlights the enhancement of the generic pipeline model using divide and conquer approach that involves concatenation process. The aim of this approach is to produce a better input for the generic pipeline model by processing smaller part of source code files before focusing on the large chunk of source codes in a single pipeline. We implement and apply the proposed approach with the support of a tool called Java Code Clone Detector (JCCD). The result obtained shows an improvement in the rate of code clone detection and overall runtime performance as compared to the existing generic pipeline model.

Item Type:Article
Uncontrolled Keywords:divide and conquer approach, generic pipeline model
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computing
ID Code:55031
Deposited By: Fazli Masari
Deposited On:09 Aug 2016 03:53
Last Modified:15 Feb 2017 07:28

Repository Staff Only: item control page