Classification of illicit web pages using neural network

The illicit web contents such as pornography, violence, gambling, etc, have greatly polluted the mind of web users especially children and teenagers. Due to some popular web filtering techniques like Uniform Resource Locator (URL) blocking and Platform for Internet Content Selection (PICS) checking...

Full description

Bibliographic Details
Main Authors: Lee, Zhi Sam, Aizani, Mohd., Selamat, Ali, Shamsuddin, Siti Mariyam
Format: Article
Language:English
Published: Penerbit UTM Press 2007
Subjects:
Online Access:http://eprints.utm.my/8178/
http://eprints.utm.my/8178/1/MohdAizainiMaarof2007_CLassificationofIllicitWebPagesUsingNeural.pdf
Description
Summary:The illicit web contents such as pornography, violence, gambling, etc, have greatly polluted the mind of web users especially children and teenagers. Due to some popular web filtering techniques like Uniform Resource Locator (URL) blocking and Platform for Internet Content Selection (PICS) checking are limited against today dynamic web content, hence content based analysis techniques with effective model are highly desired In this paper we propose textual content analysis model using entropy term weighting scheme to classify pornography and sex education web pages. We examine the entropy scheme with two other common term weighting schemes which are TFIDF and Glasgow. Those techniques are examined extensively with artificial neural network using small class dataset. In this study, we found that our proposed model archive better performance from the aspects of accuracy, convergence speed and stability.