Universiti Teknologi Malaysia Institutional Repository

Pornography web pages classification with textual content analysis using entropy term weighting scheme for small class dataset

Sam, Lee Zhi and Maarof, Mohd. Aizaini and Selamat, Ali and Shamsuddin, Siti Mariyam (2007) Pornography web pages classification with textual content analysis using entropy term weighting scheme for small class dataset. In: Postgraduate Annual Research Seminar (PARS’ 07). , 2007, UTM.

Full text not available from this repository.

Abstract

The fast growth of internet make objectionable web content such as pornography and violence easily explore to web users especially children and teenagers. Due to some popular web filtering techniques like Uniform Resource Locator blocking and Platform for Internet Content Selection checking are limited against today dynamic web content, hence content based analysis techniques with effective model are highly desired. This paper we propose textual content analysis model using entropy term weighting scheme to classify pornography and sex education web pages. We examine the entropy scheme with two other common term weighting schemes which are TFIDF and Glasgow. Those techniques are examined extensively with artificial neural network using small class dataset. We found that our proposed model archive better performance from the aspects of accuracy, convergence speed and stability.

Item Type:Conference or Workshop Item (Paper)
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computer Science and Information System
ID Code:14359
Deposited By: Liza Porijo
Deposited On:24 Aug 2011 07:23
Last Modified:18 Sep 2017 07:44

Repository Staff Only: item control page