Bilingual dictionary approach for Malay-English cross-language information retrieval

Cross-language information retrieval (CLIR) is the process of providing queries in one language and returning documents relevant to that query which is written in a different language. A popular approach to CLIR is to translate the query into the language of the documents being retrieved. One of the...

Full description

Bibliographic Details
Main Authors: Rais, Nurjannaton Hidayah, Abdullah, Muhamad Taufik, Abdul Kadir, Rabiah
Format: Article
Language:English
Published: David Publishing 2011
Online Access:http://psasir.upm.edu.my/id/eprint/22490/
http://psasir.upm.edu.my/id/eprint/22490/1/Bilingual%20dictionary%20approach%20for%20Malay-English%20cross-language%20information%20retrieval.pdf
_version_ 1848844498943082496
author Rais, Nurjannaton Hidayah
Abdullah, Muhamad Taufik
Abdul Kadir, Rabiah
author_facet Rais, Nurjannaton Hidayah
Abdullah, Muhamad Taufik
Abdul Kadir, Rabiah
author_sort Rais, Nurjannaton Hidayah
building UPM Institutional Repository
collection Online Access
description Cross-language information retrieval (CLIR) is the process of providing queries in one language and returning documents relevant to that query which is written in a different language. A popular approach to CLIR is to translate the query into the language of the documents being retrieved. One of the simplest and most effective methods for query translation is to perform dictionary look up based on a bilingual dictionary. Direct translation using bilingual dictionary prune three main problems: (1) knowing how a term expressed in one language might be written in another; (2) deciding which of the possible translations should be retained and (3) deciding how to properly weight the importance of translation alternatives when more than one is retained. We evaluated the effectiveness of Malay-English CLIR system using bilingual dictionary approach. In this study, we presented the evaluation results for dictionary-based CLIR. A document collection containing newspaper articles and a related set of 35 search queries were used in this test. First, monolingual baseline queries were created manually in Malay and English languages. Secondly, queries in Malay language were automatically translated into English language, and vice versa. There are two basic translation approaches using bilingual dictionary: select the first translation listed in the dictionary and select all translations listed in the dictionary, for each query. Then, alternative weighting scheme were applied to the second query translation approach, select all translations listed in the dictionary, to enhance retrieval performance. These three experiments were evaluated using Mean Average Precision (MAP) and Average Recall-Precision graph. The results were compared to monolingual IR for Malay and English document collection, respectively.
first_indexed 2025-11-15T08:31:53Z
format Article
id upm-22490
institution Universiti Putra Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T08:31:53Z
publishDate 2011
publisher David Publishing
recordtype eprints
repository_type Digital Repository
spelling upm-224902015-11-27T07:57:45Z http://psasir.upm.edu.my/id/eprint/22490/ Bilingual dictionary approach for Malay-English cross-language information retrieval Rais, Nurjannaton Hidayah Abdullah, Muhamad Taufik Abdul Kadir, Rabiah Cross-language information retrieval (CLIR) is the process of providing queries in one language and returning documents relevant to that query which is written in a different language. A popular approach to CLIR is to translate the query into the language of the documents being retrieved. One of the simplest and most effective methods for query translation is to perform dictionary look up based on a bilingual dictionary. Direct translation using bilingual dictionary prune three main problems: (1) knowing how a term expressed in one language might be written in another; (2) deciding which of the possible translations should be retained and (3) deciding how to properly weight the importance of translation alternatives when more than one is retained. We evaluated the effectiveness of Malay-English CLIR system using bilingual dictionary approach. In this study, we presented the evaluation results for dictionary-based CLIR. A document collection containing newspaper articles and a related set of 35 search queries were used in this test. First, monolingual baseline queries were created manually in Malay and English languages. Secondly, queries in Malay language were automatically translated into English language, and vice versa. There are two basic translation approaches using bilingual dictionary: select the first translation listed in the dictionary and select all translations listed in the dictionary, for each query. Then, alternative weighting scheme were applied to the second query translation approach, select all translations listed in the dictionary, to enhance retrieval performance. These three experiments were evaluated using Mean Average Precision (MAP) and Average Recall-Precision graph. The results were compared to monolingual IR for Malay and English document collection, respectively. David Publishing 2011-05 Article PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/22490/1/Bilingual%20dictionary%20approach%20for%20Malay-English%20cross-language%20information%20retrieval.pdf Rais, Nurjannaton Hidayah and Abdullah, Muhamad Taufik and Abdul Kadir, Rabiah (2011) Bilingual dictionary approach for Malay-English cross-language information retrieval. Journal of Communication and Computer, 8 (5). pp. 354-360. ISSN 1548-7709; ESSN: 1930-1553 http://www.davidpublishing.com/show.html?170
spellingShingle Rais, Nurjannaton Hidayah
Abdullah, Muhamad Taufik
Abdul Kadir, Rabiah
Bilingual dictionary approach for Malay-English cross-language information retrieval
title Bilingual dictionary approach for Malay-English cross-language information retrieval
title_full Bilingual dictionary approach for Malay-English cross-language information retrieval
title_fullStr Bilingual dictionary approach for Malay-English cross-language information retrieval
title_full_unstemmed Bilingual dictionary approach for Malay-English cross-language information retrieval
title_short Bilingual dictionary approach for Malay-English cross-language information retrieval
title_sort bilingual dictionary approach for malay-english cross-language information retrieval
url http://psasir.upm.edu.my/id/eprint/22490/
http://psasir.upm.edu.my/id/eprint/22490/
http://psasir.upm.edu.my/id/eprint/22490/1/Bilingual%20dictionary%20approach%20for%20Malay-English%20cross-language%20information%20retrieval.pdf