Bilingual dictionary approach for Malay-English cross-language information retrieval
Cross-language information retrieval (CLIR) is the process of providing queries in one language and returning documents relevant to that query which is written in a different language. A popular approach to CLIR is to translate the query into the language of the documents being retrieved. One of the...
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
David Publishing
2011
|
| Online Access: | http://psasir.upm.edu.my/id/eprint/22490/ http://psasir.upm.edu.my/id/eprint/22490/1/Bilingual%20dictionary%20approach%20for%20Malay-English%20cross-language%20information%20retrieval.pdf |
| _version_ | 1848844498943082496 |
|---|---|
| author | Rais, Nurjannaton Hidayah Abdullah, Muhamad Taufik Abdul Kadir, Rabiah |
| author_facet | Rais, Nurjannaton Hidayah Abdullah, Muhamad Taufik Abdul Kadir, Rabiah |
| author_sort | Rais, Nurjannaton Hidayah |
| building | UPM Institutional Repository |
| collection | Online Access |
| description | Cross-language information retrieval (CLIR) is the process of providing queries in one language and returning documents relevant to that query which is written in a different language. A popular approach to CLIR is to translate the query into the language of the documents being retrieved. One of the simplest and most effective methods for query translation is to perform dictionary look up based on a bilingual dictionary. Direct translation using bilingual dictionary prune three main problems: (1) knowing how a term expressed in one language might be written in another; (2) deciding which of the possible translations should be retained and (3) deciding how to properly weight the importance of translation alternatives when more than one is retained. We evaluated the effectiveness of Malay-English CLIR system using bilingual dictionary approach. In this study, we presented the evaluation results for dictionary-based CLIR. A document collection containing newspaper articles and a related set of 35 search queries were used in this test. First, monolingual baseline queries were created manually in Malay and English languages. Secondly, queries in Malay language
were automatically translated into English language, and vice versa. There are two basic translation approaches using bilingual dictionary: select the first translation listed in the dictionary and select all translations listed in the dictionary, for each query. Then, alternative weighting scheme were applied to the second query translation approach, select all translations listed in the dictionary, to enhance retrieval performance. These three experiments were evaluated using Mean Average Precision (MAP) and Average Recall-Precision graph. The results were compared to monolingual IR for Malay and English document collection, respectively. |
| first_indexed | 2025-11-15T08:31:53Z |
| format | Article |
| id | upm-22490 |
| institution | Universiti Putra Malaysia |
| institution_category | Local University |
| language | English |
| last_indexed | 2025-11-15T08:31:53Z |
| publishDate | 2011 |
| publisher | David Publishing |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | upm-224902015-11-27T07:57:45Z http://psasir.upm.edu.my/id/eprint/22490/ Bilingual dictionary approach for Malay-English cross-language information retrieval Rais, Nurjannaton Hidayah Abdullah, Muhamad Taufik Abdul Kadir, Rabiah Cross-language information retrieval (CLIR) is the process of providing queries in one language and returning documents relevant to that query which is written in a different language. A popular approach to CLIR is to translate the query into the language of the documents being retrieved. One of the simplest and most effective methods for query translation is to perform dictionary look up based on a bilingual dictionary. Direct translation using bilingual dictionary prune three main problems: (1) knowing how a term expressed in one language might be written in another; (2) deciding which of the possible translations should be retained and (3) deciding how to properly weight the importance of translation alternatives when more than one is retained. We evaluated the effectiveness of Malay-English CLIR system using bilingual dictionary approach. In this study, we presented the evaluation results for dictionary-based CLIR. A document collection containing newspaper articles and a related set of 35 search queries were used in this test. First, monolingual baseline queries were created manually in Malay and English languages. Secondly, queries in Malay language were automatically translated into English language, and vice versa. There are two basic translation approaches using bilingual dictionary: select the first translation listed in the dictionary and select all translations listed in the dictionary, for each query. Then, alternative weighting scheme were applied to the second query translation approach, select all translations listed in the dictionary, to enhance retrieval performance. These three experiments were evaluated using Mean Average Precision (MAP) and Average Recall-Precision graph. The results were compared to monolingual IR for Malay and English document collection, respectively. David Publishing 2011-05 Article PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/22490/1/Bilingual%20dictionary%20approach%20for%20Malay-English%20cross-language%20information%20retrieval.pdf Rais, Nurjannaton Hidayah and Abdullah, Muhamad Taufik and Abdul Kadir, Rabiah (2011) Bilingual dictionary approach for Malay-English cross-language information retrieval. Journal of Communication and Computer, 8 (5). pp. 354-360. ISSN 1548-7709; ESSN: 1930-1553 http://www.davidpublishing.com/show.html?170 |
| spellingShingle | Rais, Nurjannaton Hidayah Abdullah, Muhamad Taufik Abdul Kadir, Rabiah Bilingual dictionary approach for Malay-English cross-language information retrieval |
| title | Bilingual dictionary approach for Malay-English cross-language information retrieval |
| title_full | Bilingual dictionary approach for Malay-English cross-language information retrieval |
| title_fullStr | Bilingual dictionary approach for Malay-English cross-language information retrieval |
| title_full_unstemmed | Bilingual dictionary approach for Malay-English cross-language information retrieval |
| title_short | Bilingual dictionary approach for Malay-English cross-language information retrieval |
| title_sort | bilingual dictionary approach for malay-english cross-language information retrieval |
| url | http://psasir.upm.edu.my/id/eprint/22490/ http://psasir.upm.edu.my/id/eprint/22490/ http://psasir.upm.edu.my/id/eprint/22490/1/Bilingual%20dictionary%20approach%20for%20Malay-English%20cross-language%20information%20retrieval.pdf |