Scalable Text Filtering System

The advancement in computing enables anyone to become information producer, resulting in rapidly growing information in the Internet. One concern arising from this phenomenon is the easy access to offensive, vulgar or obscene page by anyone with access to Internet. One of the solutions for this conc...

Full description

Bibliographic Details
Main Authors: Foong, Oi Mean, Ahmad Izuddin Zainal Abidin, A., Yong, S.P.
Format: Conference or Workshop Item
Language:English
Published: 2006
Subjects:
Online Access:http://scholars.utp.edu.my/id/eprint/2569/
http://scholars.utp.edu.my/id/eprint/2569/1/Scalable_Text_Filtering.pdf
_version_ 1848659266299232256
author Foong, Oi Mean
Ahmad Izuddin Zainal Abidin, A.
Yong, S.P.
author_facet Foong, Oi Mean
Ahmad Izuddin Zainal Abidin, A.
Yong, S.P.
author_sort Foong, Oi Mean
building UTP Institutional Repository
collection Online Access
description The advancement in computing enables anyone to become information producer, resulting in rapidly growing information in the Internet. One concern arising from this phenomenon is the easy access to offensive, vulgar or obscene page by anyone with access to Internet. One of the solutions for this concern is filtering software. This paper presents a prototype called DocFilter that filters harmful content of text document without human intervention. The prototype is designed to extract each word of the document, stem the words into its root and compare each word to the list of harmful words in the hash set. Two systems evaluation were conducted to ascertain the performance of DocFilter system. Using various blocking levels, the prototype yields average filtering scores of 73.4%. The system is regarded to have produced an effective filtering accuracy of offensive words for most English text document.
first_indexed 2025-11-13T07:27:42Z
format Conference or Workshop Item
id oai:scholars.utp.edu.my:2569
institution Universiti Teknologi Petronas
institution_category Local University
language English
last_indexed 2025-11-13T07:27:42Z
publishDate 2006
recordtype eprints
repository_type Digital Repository
spelling oai:scholars.utp.edu.my:25692017-01-19T08:27:23Z http://scholars.utp.edu.my/id/eprint/2569/ Scalable Text Filtering System Foong, Oi Mean Ahmad Izuddin Zainal Abidin, A. Yong, S.P. QA75 Electronic computers. Computer science The advancement in computing enables anyone to become information producer, resulting in rapidly growing information in the Internet. One concern arising from this phenomenon is the easy access to offensive, vulgar or obscene page by anyone with access to Internet. One of the solutions for this concern is filtering software. This paper presents a prototype called DocFilter that filters harmful content of text document without human intervention. The prototype is designed to extract each word of the document, stem the words into its root and compare each word to the list of harmful words in the hash set. Two systems evaluation were conducted to ascertain the performance of DocFilter system. Using various blocking levels, the prototype yields average filtering scores of 73.4%. The system is regarded to have produced an effective filtering accuracy of offensive words for most English text document. 2006 Conference or Workshop Item PeerReviewed application/pdf en http://scholars.utp.edu.my/id/eprint/2569/1/Scalable_Text_Filtering.pdf Foong, Oi Mean and Ahmad Izuddin Zainal Abidin, A. and Yong, S.P. (2006) Scalable Text Filtering System. In: M2USIC Conference, 16-17 November 2006, Kuala Lumpur, Malaysia.
spellingShingle QA75 Electronic computers. Computer science
Foong, Oi Mean
Ahmad Izuddin Zainal Abidin, A.
Yong, S.P.
Scalable Text Filtering System
title Scalable Text Filtering System
title_full Scalable Text Filtering System
title_fullStr Scalable Text Filtering System
title_full_unstemmed Scalable Text Filtering System
title_short Scalable Text Filtering System
title_sort scalable text filtering system
topic QA75 Electronic computers. Computer science
url http://scholars.utp.edu.my/id/eprint/2569/
http://scholars.utp.edu.my/id/eprint/2569/1/Scalable_Text_Filtering.pdf