Scalable Text Filtering System
The advancement in computing enables anyone to become information producer, resulting in rapidly growing information in the Internet. One concern arising from this phenomenon is the easy access to offensive, vulgar or obscene page by anyone with access to Internet. One of the solutions for this conc...
| Main Authors: | , , |
|---|---|
| Format: | Conference or Workshop Item |
| Language: | English |
| Published: |
2006
|
| Subjects: | |
| Online Access: | http://scholars.utp.edu.my/id/eprint/2569/ http://scholars.utp.edu.my/id/eprint/2569/1/Scalable_Text_Filtering.pdf |
| _version_ | 1848659266299232256 |
|---|---|
| author | Foong, Oi Mean Ahmad Izuddin Zainal Abidin, A. Yong, S.P. |
| author_facet | Foong, Oi Mean Ahmad Izuddin Zainal Abidin, A. Yong, S.P. |
| author_sort | Foong, Oi Mean |
| building | UTP Institutional Repository |
| collection | Online Access |
| description | The advancement in computing enables anyone to become information producer, resulting in rapidly growing information in the Internet. One concern arising from this phenomenon is the easy access to offensive, vulgar or obscene page by anyone with access to Internet. One of the solutions for this concern is filtering software. This paper presents a prototype called DocFilter that filters harmful content of text document without human intervention. The prototype is designed to extract each word of the document, stem the words into its root and compare each word to the list of harmful words in the hash set. Two systems evaluation were conducted to ascertain the performance of DocFilter system. Using various blocking levels, the prototype yields average filtering scores of 73.4%. The system is regarded to have produced an effective filtering accuracy of offensive words for most English text document. |
| first_indexed | 2025-11-13T07:27:42Z |
| format | Conference or Workshop Item |
| id | oai:scholars.utp.edu.my:2569 |
| institution | Universiti Teknologi Petronas |
| institution_category | Local University |
| language | English |
| last_indexed | 2025-11-13T07:27:42Z |
| publishDate | 2006 |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | oai:scholars.utp.edu.my:25692017-01-19T08:27:23Z http://scholars.utp.edu.my/id/eprint/2569/ Scalable Text Filtering System Foong, Oi Mean Ahmad Izuddin Zainal Abidin, A. Yong, S.P. QA75 Electronic computers. Computer science The advancement in computing enables anyone to become information producer, resulting in rapidly growing information in the Internet. One concern arising from this phenomenon is the easy access to offensive, vulgar or obscene page by anyone with access to Internet. One of the solutions for this concern is filtering software. This paper presents a prototype called DocFilter that filters harmful content of text document without human intervention. The prototype is designed to extract each word of the document, stem the words into its root and compare each word to the list of harmful words in the hash set. Two systems evaluation were conducted to ascertain the performance of DocFilter system. Using various blocking levels, the prototype yields average filtering scores of 73.4%. The system is regarded to have produced an effective filtering accuracy of offensive words for most English text document. 2006 Conference or Workshop Item PeerReviewed application/pdf en http://scholars.utp.edu.my/id/eprint/2569/1/Scalable_Text_Filtering.pdf Foong, Oi Mean and Ahmad Izuddin Zainal Abidin, A. and Yong, S.P. (2006) Scalable Text Filtering System. In: M2USIC Conference, 16-17 November 2006, Kuala Lumpur, Malaysia. |
| spellingShingle | QA75 Electronic computers. Computer science Foong, Oi Mean Ahmad Izuddin Zainal Abidin, A. Yong, S.P. Scalable Text Filtering System |
| title | Scalable Text Filtering System |
| title_full | Scalable Text Filtering System |
| title_fullStr | Scalable Text Filtering System |
| title_full_unstemmed | Scalable Text Filtering System |
| title_short | Scalable Text Filtering System |
| title_sort | scalable text filtering system |
| topic | QA75 Electronic computers. Computer science |
| url | http://scholars.utp.edu.my/id/eprint/2569/ http://scholars.utp.edu.my/id/eprint/2569/1/Scalable_Text_Filtering.pdf |