Context-dependent multilingual lexical lookup for under-resourced languages
Current approaches for word sense disambiguation and translation selection typically require lexical resources or large bilingual corpora with rich information fields and annotations, which are often infeasible for under-resourced languages. We extract translation context knowledge from a bilingual...
| Main Authors: | , , , , |
|---|---|
| Format: | Proceeding |
| Language: | English |
| Published: |
2013
|
| Subjects: | |
| Online Access: | http://ir.unimas.my/id/eprint/16527/ http://ir.unimas.my/id/eprint/16527/1/Context-dependent%20multilingual%20lexical%20lookup%20for%20under-resourced%20languages%20%28abstrak%29.pdf |
| _version_ | 1848838081232240640 |
|---|---|
| author | Lian, Tze Lim Enya, Kong Tang Lay-Ki, Soon Tek, Yong Lim Ranaivo-Malançon, Bali |
| author_facet | Lian, Tze Lim Enya, Kong Tang Lay-Ki, Soon Tek, Yong Lim Ranaivo-Malançon, Bali |
| author_sort | Lian, Tze Lim |
| building | UNIMAS Institutional Repository |
| collection | Online Access |
| description | Current approaches for word sense disambiguation and translation selection typically require lexical resources or large bilingual corpora with rich information fields and annotations, which are often infeasible for under-resourced languages. We extract translation context knowledge from a bilingual comparable corpora of a richer-resourced language pair, and inject it into a multilingual lexicon. The multilingual lexicon can then be used to perform context-dependent lexical lookup on texts of any language, including under-resourced ones. Evaluations on a prototype lookup tool, trained on a English-Malay bilingual Wikipedia corpus, show a precision score of 0.65 (baseline 0.55) and mean reciprocal rank score of 0.81 (baseline 0.771). Based on the early encouraging results, the context-dependent lexical lookup tool may be developed further into an intelligent reading aid, to help users grasp the gist of a second or foreign language text. |
| first_indexed | 2025-11-15T06:49:53Z |
| format | Proceeding |
| id | unimas-16527 |
| institution | Universiti Malaysia Sarawak |
| institution_category | Local University |
| language | English |
| last_indexed | 2025-11-15T06:49:53Z |
| publishDate | 2013 |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | unimas-165272017-06-07T01:02:33Z http://ir.unimas.my/id/eprint/16527/ Context-dependent multilingual lexical lookup for under-resourced languages Lian, Tze Lim Enya, Kong Tang Lay-Ki, Soon Tek, Yong Lim Ranaivo-Malançon, Bali T Technology (General) Current approaches for word sense disambiguation and translation selection typically require lexical resources or large bilingual corpora with rich information fields and annotations, which are often infeasible for under-resourced languages. We extract translation context knowledge from a bilingual comparable corpora of a richer-resourced language pair, and inject it into a multilingual lexicon. The multilingual lexicon can then be used to perform context-dependent lexical lookup on texts of any language, including under-resourced ones. Evaluations on a prototype lookup tool, trained on a English-Malay bilingual Wikipedia corpus, show a precision score of 0.65 (baseline 0.55) and mean reciprocal rank score of 0.81 (baseline 0.771). Based on the early encouraging results, the context-dependent lexical lookup tool may be developed further into an intelligent reading aid, to help users grasp the gist of a second or foreign language text. 2013 Proceeding PeerReviewed text en http://ir.unimas.my/id/eprint/16527/1/Context-dependent%20multilingual%20lexical%20lookup%20for%20under-resourced%20languages%20%28abstrak%29.pdf Lian, Tze Lim and Enya, Kong Tang and Lay-Ki, Soon and Tek, Yong Lim and Ranaivo-Malançon, Bali (2013) Context-dependent multilingual lexical lookup for under-resourced languages. In: 51st Annual Meeting of the Association for Computational Linguistics, ACL 2013, 4 August 2013 through 9 August 2013, Sofia; Bulgaria. |
| spellingShingle | T Technology (General) Lian, Tze Lim Enya, Kong Tang Lay-Ki, Soon Tek, Yong Lim Ranaivo-Malançon, Bali Context-dependent multilingual lexical lookup for under-resourced languages |
| title | Context-dependent multilingual lexical lookup for under-resourced languages |
| title_full | Context-dependent multilingual lexical lookup for under-resourced languages |
| title_fullStr | Context-dependent multilingual lexical lookup for under-resourced languages |
| title_full_unstemmed | Context-dependent multilingual lexical lookup for under-resourced languages |
| title_short | Context-dependent multilingual lexical lookup for under-resourced languages |
| title_sort | context-dependent multilingual lexical lookup for under-resourced languages |
| topic | T Technology (General) |
| url | http://ir.unimas.my/id/eprint/16527/ http://ir.unimas.my/id/eprint/16527/1/Context-dependent%20multilingual%20lexical%20lookup%20for%20under-resourced%20languages%20%28abstrak%29.pdf |