Universiti Teknologi Malaysia Institutional Repository

Features based text similarity detection

Kok Kent, Chow and Salim, Naomie (2010) Features based text similarity detection. Journal of Computing, 2 (1). pp. 53-57. ISSN 2151-9617

Full text not available from this repository.

Official URL: http://arxiv.org/pdf/1001.3487v1

Abstract

As the Internet help us cross cultural border by providing different information, plagiarism issue is bound to arise. As a result, plagiarism detection becomes more demanding in overcoming this issue. Different plagiarism detection tools have been developed based on various detection techniques. Nowadays, fingerprint matching technique plays an important role in those detection tools. However, in handling some large content articles, there are some weaknesses in fingerprint matching technique especially in space and time consumption issue. In this paper, we propose a new approach to detect plagiarism which integrates the use of fingerprint matching technique with four key features to assist in the detection process. These proposed features are capable to choose the main point or key sentence in the articles to be compared. Those selected sentence will be undergo the fingerprint matching process in order to detect the similarity between the sentences. Hence, time and space usage for the comparison process is reduced without affecting the effectiveness of the plagiarism detection.

Item Type:Article
Uncontrolled Keywords:fingerprint matching technique, longest common subsequence (LCS), plagiarism
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computer Science and Information System
ID Code:25940
Deposited By: Narimah Nawil
Deposited On:18 Jun 2012 03:05
Last Modified:22 Mar 2018 10:53

Repository Staff Only: item control page