Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation

Communication over the Internet becomes the necessity of life. Multi-lingual machine translation systems are developed to support such communication. One of the most commonly used approaches is the example-based approach which requires a large set of examples as reference. These examples are prepare...

Full description

Bibliographic Details
Main Author: Ng , Pek Kuan
Format: Thesis
Language:English
Published: 2012
Subjects:
Online Access:http://eprints.usm.my/42140/
http://eprints.usm.my/42140/1/NG_PEK_KUAN.pdf
_version_ 1848879479509745664
author Ng , Pek Kuan
author_facet Ng , Pek Kuan
author_sort Ng , Pek Kuan
building USM Institutional Repository
collection Online Access
description Communication over the Internet becomes the necessity of life. Multi-lingual machine translation systems are developed to support such communication. One of the most commonly used approaches is the example-based approach which requires a large set of examples as reference. These examples are prepared by aligning the parallel texts either manually or semi-automatically with human intervention. This requires much effort and is time-consuming considering the large number of examples needed to ensure the quality of the translation. Moreover, the fact that humans make mistakes and has preferences raises the consistency issue. Hence, there is an urgent need to develop an automatic aligner.
first_indexed 2025-11-15T17:47:53Z
format Thesis
id usm-42140
institution Universiti Sains Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T17:47:53Z
publishDate 2012
recordtype eprints
repository_type Digital Repository
spelling usm-421402019-04-12T05:26:23Z http://eprints.usm.my/42140/ Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation Ng , Pek Kuan QA75.5-76.95 Electronic computers. Computer science Communication over the Internet becomes the necessity of life. Multi-lingual machine translation systems are developed to support such communication. One of the most commonly used approaches is the example-based approach which requires a large set of examples as reference. These examples are prepared by aligning the parallel texts either manually or semi-automatically with human intervention. This requires much effort and is time-consuming considering the large number of examples needed to ensure the quality of the translation. Moreover, the fact that humans make mistakes and has preferences raises the consistency issue. Hence, there is an urgent need to develop an automatic aligner. 2012-03 Thesis NonPeerReviewed application/pdf en http://eprints.usm.my/42140/1/NG_PEK_KUAN.pdf Ng , Pek Kuan (2012) Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation. Masters thesis, Universiti Sains Malaysia.
spellingShingle QA75.5-76.95 Electronic computers. Computer science
Ng , Pek Kuan
Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation
title Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation
title_full Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation
title_fullStr Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation
title_full_unstemmed Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation
title_short Automatic Text Alignment Using Recursive Hapax-Based Cut-Through Fragmentation
title_sort automatic text alignment using recursive hapax-based cut-through fragmentation
topic QA75.5-76.95 Electronic computers. Computer science
url http://eprints.usm.my/42140/
http://eprints.usm.my/42140/1/NG_PEK_KUAN.pdf