Universiti Teknologi Malaysia Institutional Repository

Fraudulent e-Commerce website detection model using HTML, text and image features

Khoo, Eric and Zainal, Anazida and Ariffin, Nurfadilah and Kassim, Mohd. Nizam and Maarof, Mohd Aizaini and Bakhtiari, Majid (2020) Fraudulent e-Commerce website detection model using HTML, text and image features. In: 11th International Conference on Soft Computing and Pattern Recognition, SoCPaR 2019, and 11th World Congress on Nature and Biologically Inspired Computing, NaBIC 2019, 13 – 15 December 2019, Hyderabad, India.

Full text not available from this repository.

Official URL: http://dx.doi.org/10.1007/978-3-030-49345-5_19

Abstract

Many of Internet users have been the victims of fraudulent e-commerce websites and the number grows. This paper presents an investigation on three types of features namely HTML tags, textual content and image of the website that could possibly contain some patterns that indicate it is fraudulent. Four machine learning algorithms were used to measure the accuracy of the fraudulent e-commerce websites detection. These techniques are Linear Regression, Decision Tree, Random Forest and XGBoost. 497 e-commerce websites were used as training and testing dataset. Testing was done in two phases. In phase one, each features was tested to see its discriminative capability. Meanwhile in phase two, these features were combined. The result shows that textual content has consistently outperformed the other two features especially when XGBoost was used as a classifier. With combined features, overall accuracy has improved and best result of accuracy recorded was 98.7% achieved when Linear Regression was used as a classifier.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:Fraudulent website, HTML tags and image
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computing
ID Code:94157
Deposited By: Widya Wahid
Deposited On:28 Feb 2022 13:24
Last Modified:28 Feb 2022 13:24

Repository Staff Only: item control page