Syllable-based Malay Word Stemmer

Word stemmer is one of the basic and crucial text processing tools in any languages. Word stemmer is not only useful in morphological study but also play an important role in word level context analysis. Due to the existence of prefix, suffix, infix and a combination of affixes in Malay word, it r...

Full description

Bibliographic Details
Main Authors: Jun, Choi Lee, Mohamad Othman, Rosita, Mohamad , Nurul Zawiyah
Format: Proceeding
Language:English
Published: Universiti Malaysia Sarawak, (UNIMAS) 2014
Subjects:
Online Access:http://ir.unimas.my/id/eprint/1367/
http://ir.unimas.my/id/eprint/1367/1/Syllable-based%2BMalay%2BWord%2BStemmer.pdf
Description
Summary:Word stemmer is one of the basic and crucial text processing tools in any languages. Word stemmer is not only useful in morphological study but also play an important role in word level context analysis. Due to the existence of prefix, suffix, infix and a combination of affixes in Malay word, it raises the complexity of performing stemming to Malay word. An approach to stem Malay word using syllabification algorithm is introduced. This approach performs stemming through comparing syllable in the word thus reduces the parsing processes. The approach shows high practicality as it produces a very high accuracy in the evaluation.