Context-dependent multilingual lexical lookup for under-resourced languages

Current approaches for word sense disambiguation and translation selection typically require lexical resources or large bilingual corpora with rich information fields and annotations, which are often infeasible for under-resourced languages. We extract translation context knowledge from a bilingual...

Full description

Bibliographic Details
Main Authors: Lian, Tze Lim, Enya, Kong Tang, Lay-Ki, Soon, Tek, Yong Lim, Ranaivo-Malançon, Bali
Format: Proceeding
Language:English
Published: 2013
Subjects:
Online Access:http://ir.unimas.my/id/eprint/16527/
http://ir.unimas.my/id/eprint/16527/1/Context-dependent%20multilingual%20lexical%20lookup%20for%20under-resourced%20languages%20%28abstrak%29.pdf
_version_ 1848838081232240640
author Lian, Tze Lim
Enya, Kong Tang
Lay-Ki, Soon
Tek, Yong Lim
Ranaivo-Malançon, Bali
author_facet Lian, Tze Lim
Enya, Kong Tang
Lay-Ki, Soon
Tek, Yong Lim
Ranaivo-Malançon, Bali
author_sort Lian, Tze Lim
building UNIMAS Institutional Repository
collection Online Access
description Current approaches for word sense disambiguation and translation selection typically require lexical resources or large bilingual corpora with rich information fields and annotations, which are often infeasible for under-resourced languages. We extract translation context knowledge from a bilingual comparable corpora of a richer-resourced language pair, and inject it into a multilingual lexicon. The multilingual lexicon can then be used to perform context-dependent lexical lookup on texts of any language, including under-resourced ones. Evaluations on a prototype lookup tool, trained on a English-Malay bilingual Wikipedia corpus, show a precision score of 0.65 (baseline 0.55) and mean reciprocal rank score of 0.81 (baseline 0.771). Based on the early encouraging results, the context-dependent lexical lookup tool may be developed further into an intelligent reading aid, to help users grasp the gist of a second or foreign language text.
first_indexed 2025-11-15T06:49:53Z
format Proceeding
id unimas-16527
institution Universiti Malaysia Sarawak
institution_category Local University
language English
last_indexed 2025-11-15T06:49:53Z
publishDate 2013
recordtype eprints
repository_type Digital Repository
spelling unimas-165272017-06-07T01:02:33Z http://ir.unimas.my/id/eprint/16527/ Context-dependent multilingual lexical lookup for under-resourced languages Lian, Tze Lim Enya, Kong Tang Lay-Ki, Soon Tek, Yong Lim Ranaivo-Malançon, Bali T Technology (General) Current approaches for word sense disambiguation and translation selection typically require lexical resources or large bilingual corpora with rich information fields and annotations, which are often infeasible for under-resourced languages. We extract translation context knowledge from a bilingual comparable corpora of a richer-resourced language pair, and inject it into a multilingual lexicon. The multilingual lexicon can then be used to perform context-dependent lexical lookup on texts of any language, including under-resourced ones. Evaluations on a prototype lookup tool, trained on a English-Malay bilingual Wikipedia corpus, show a precision score of 0.65 (baseline 0.55) and mean reciprocal rank score of 0.81 (baseline 0.771). Based on the early encouraging results, the context-dependent lexical lookup tool may be developed further into an intelligent reading aid, to help users grasp the gist of a second or foreign language text. 2013 Proceeding PeerReviewed text en http://ir.unimas.my/id/eprint/16527/1/Context-dependent%20multilingual%20lexical%20lookup%20for%20under-resourced%20languages%20%28abstrak%29.pdf Lian, Tze Lim and Enya, Kong Tang and Lay-Ki, Soon and Tek, Yong Lim and Ranaivo-Malançon, Bali (2013) Context-dependent multilingual lexical lookup for under-resourced languages. In: 51st Annual Meeting of the Association for Computational Linguistics, ACL 2013, 4 August 2013 through 9 August 2013, Sofia; Bulgaria.
spellingShingle T Technology (General)
Lian, Tze Lim
Enya, Kong Tang
Lay-Ki, Soon
Tek, Yong Lim
Ranaivo-Malançon, Bali
Context-dependent multilingual lexical lookup for under-resourced languages
title Context-dependent multilingual lexical lookup for under-resourced languages
title_full Context-dependent multilingual lexical lookup for under-resourced languages
title_fullStr Context-dependent multilingual lexical lookup for under-resourced languages
title_full_unstemmed Context-dependent multilingual lexical lookup for under-resourced languages
title_short Context-dependent multilingual lexical lookup for under-resourced languages
title_sort context-dependent multilingual lexical lookup for under-resourced languages
topic T Technology (General)
url http://ir.unimas.my/id/eprint/16527/
http://ir.unimas.my/id/eprint/16527/1/Context-dependent%20multilingual%20lexical%20lookup%20for%20under-resourced%20languages%20%28abstrak%29.pdf