Selamat, Ali and Lee, Zhi Sam and Maarof, Mohd. Aizaini and Shamsuddin, Siti Mariyam (2011) Improved web page identification method using neural networks. International Journal of Computational Intelligence and Applications, 10 (1). pp. 87-114. ISSN 1469-0268
Full text not available from this repository.
Official URL: http://dx.doi.org/10.1142/S1469026811003008
Abstract
In this paper, an improved web page classification method (IWPCM) using neural networks to identify the illicit contents of web pages is proposed. The proposed IWPCM approach is based on the improvement of feature selection of the web pages using class based feature vectors (CPBF). The CPBF feature selection approach has been calculated by considering the important term's weight for illicit web documents and reduce the dependency of the less important term's weight for normal web documents. The IWPCM approach has been examined using the modified term-weighting scheme by comparing it with several traditional term-weighting schemes for non-illicit and illicit web contents available from the web. The precision, recall, and F1 measures have been used to evaluate the effectiveness of the proposed IWPCM approach. The experimental results have shown that the proposed improved term-weighting scheme has been able to identify the non-illicit and illicit web contents available from the experimental datasets.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | artificial neural network, illicit web page classification, term-weighting scheme, textual content analysis |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Computer Science and Information System |
ID Code: | 29213 |
Deposited By: | Yanti Mohd Shah |
Deposited On: | 25 Feb 2013 07:07 |
Last Modified: | 17 Mar 2019 03:03 |
Repository Staff Only: item control page