Universiti Teknologi Malaysia Institutional Repository

Comparative study on corpus development for Malay investment fraud detection in website

Din, M. M. and Hashim, N. H. H. and Siraj, M. M. (2017) Comparative study on corpus development for Malay investment fraud detection in website. Journal of Fundamental and Applied Sciences, 9 (6, SI). pp. 828-838. ISSN 1112-9867

Full text not available from this repository.

Official URL: http://dx.doi.org/10.4314/jfas.v9i6s.62

Abstract

In the online world, fraudster scan easily manipulate people to gain something and usually for monetary gain. Corpus development research can be use identify keywords used by fraudsters online to prevent the crime. The aim of this research is to develop a corpus for Malay investment fraud so that it can be used in detection and classification of investment fraud in Malay website and compare the most suitable technique. In this research, Part-of-Speech tagger (POS) and Named Entity Recognition (NER) tagger are selected. Proposed methodology that are used in this research is corpus development, training and development of dataset using Naive Bayes and performance evaluation. The dataset used in this research is online news archive and discussion forums. This research able to help the law enforcements agencies in collecting and notifying the keyword used by fraudsters so that they can take any legal actions.

Item Type:Article
Uncontrolled Keywords:corpus development, information extraction, part-of-speech, named entity recognition
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computing
ID Code:77491
Deposited By: Yanti Mohd Shah
Deposited On:31 Dec 2021 08:45
Last Modified:31 Dec 2021 08:45

Repository Staff Only: item control page