Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation
Communication over the Internet becomes the necessity of life. Multi-lingual machine translation systems are developed to support such communication. One of the most commonly used approaches is the example-based approach which requires a large set of examples as reference. These examples are prepare...
| Main Author: | |
|---|---|
| Format: | Thesis |
| Language: | English |
| Published: |
2012
|
| Subjects: | |
| Online Access: | http://eprints.usm.my/42140/ http://eprints.usm.my/42140/1/NG_PEK_KUAN.pdf |
| _version_ | 1848879479509745664 |
|---|---|
| author | Ng , Pek Kuan |
| author_facet | Ng , Pek Kuan |
| author_sort | Ng , Pek Kuan |
| building | USM Institutional Repository |
| collection | Online Access |
| description | Communication over the Internet becomes the necessity of life. Multi-lingual machine translation systems are developed to support such communication. One of the most commonly used approaches is the example-based approach which requires a large set of examples as reference. These examples are prepared by aligning the parallel texts either manually or semi-automatically with human intervention. This requires much effort and is time-consuming considering the large number of examples needed to ensure the quality of the translation. Moreover, the fact that humans make mistakes and has preferences raises the consistency issue. Hence, there is an urgent need to develop an automatic aligner. |
| first_indexed | 2025-11-15T17:47:53Z |
| format | Thesis |
| id | usm-42140 |
| institution | Universiti Sains Malaysia |
| institution_category | Local University |
| language | English |
| last_indexed | 2025-11-15T17:47:53Z |
| publishDate | 2012 |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | usm-421402019-04-12T05:26:23Z http://eprints.usm.my/42140/ Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation Ng , Pek Kuan QA75.5-76.95 Electronic computers. Computer science Communication over the Internet becomes the necessity of life. Multi-lingual machine translation systems are developed to support such communication. One of the most commonly used approaches is the example-based approach which requires a large set of examples as reference. These examples are prepared by aligning the parallel texts either manually or semi-automatically with human intervention. This requires much effort and is time-consuming considering the large number of examples needed to ensure the quality of the translation. Moreover, the fact that humans make mistakes and has preferences raises the consistency issue. Hence, there is an urgent need to develop an automatic aligner. 2012-03 Thesis NonPeerReviewed application/pdf en http://eprints.usm.my/42140/1/NG_PEK_KUAN.pdf Ng , Pek Kuan (2012) Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation. Masters thesis, Universiti Sains Malaysia. |
| spellingShingle | QA75.5-76.95 Electronic computers. Computer science Ng , Pek Kuan Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation |
| title | Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation |
| title_full | Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation |
| title_fullStr | Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation |
| title_full_unstemmed | Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation |
| title_short | Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation |
| title_sort | automatic text alignment using recursive hapax-based cut-through fragmentation |
| topic | QA75.5-76.95 Electronic computers. Computer science |
| url | http://eprints.usm.my/42140/ http://eprints.usm.my/42140/1/NG_PEK_KUAN.pdf |