Text segmentation for analysing different languages
Over the past several years, researchers have applied different methods of text segmentation. Text segmentation is defined as a method of splitting a document into smaller segments, assuming with its own relevant meaning. Those segments can be classified into the tag, word, sentence, topic, phrase a...
| Main Authors: | , |
|---|---|
| Format: | Conference or Workshop Item |
| Language: | English |
| Published: |
2016
|
| Subjects: | |
| Online Access: | http://eprints.sunway.edu.my/841/ http://eprints.sunway.edu.my/841/1/Teh%20Phoey%20Lee%20conf.pdf |
| _version_ | 1848801908702052352 |
|---|---|
| author | Pak, Irina * Teh, Phoey Lee * |
| author_facet | Pak, Irina * Teh, Phoey Lee * |
| author_sort | Pak, Irina * |
| building | SU Institutional Repository |
| collection | Online Access |
| description | Over the past several years, researchers have applied different methods of text segmentation. Text segmentation is defined as a method of splitting a document into smaller segments, assuming with its own relevant meaning. Those segments can be classified into the tag, word, sentence, topic, phrase and any information unit. Firstly, this study reviews the different types of text segmentation methods used in different types of documentation, and later discusses the various reasons for utilizing it in opinion mining. The main contribution of this study includes a summarisation of research papers from the past 10 years that applied text segmentation as their main approach in text analysing. Results show that word segmentation was successfully and widely used for processing different languages. |
| first_indexed | 2025-11-14T21:14:56Z |
| format | Conference or Workshop Item |
| id | sunway-841 |
| institution | Sunway University |
| institution_category | Local University |
| language | English |
| last_indexed | 2025-11-14T21:14:56Z |
| publishDate | 2016 |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | sunway-8412019-07-23T01:41:14Z http://eprints.sunway.edu.my/841/ Text segmentation for analysing different languages Pak, Irina * Teh, Phoey Lee * QA76 Computer software Over the past several years, researchers have applied different methods of text segmentation. Text segmentation is defined as a method of splitting a document into smaller segments, assuming with its own relevant meaning. Those segments can be classified into the tag, word, sentence, topic, phrase and any information unit. Firstly, this study reviews the different types of text segmentation methods used in different types of documentation, and later discusses the various reasons for utilizing it in opinion mining. The main contribution of this study includes a summarisation of research papers from the past 10 years that applied text segmentation as their main approach in text analysing. Results show that word segmentation was successfully and widely used for processing different languages. 2016-11-11 Conference or Workshop Item PeerReviewed text en http://eprints.sunway.edu.my/841/1/Teh%20Phoey%20Lee%20conf.pdf Pak, Irina * and Teh, Phoey Lee * (2016) Text segmentation for analysing different languages. In: COMSE 2016 - First EAI International Conference on Computer Science and Engineering, 11-12 November 2016, Penang, Malaysia. http://compse-conf.org/2016/show/home |
| spellingShingle | QA76 Computer software Pak, Irina * Teh, Phoey Lee * Text segmentation for analysing different languages |
| title | Text segmentation for analysing different languages |
| title_full | Text segmentation for analysing different languages |
| title_fullStr | Text segmentation for analysing different languages |
| title_full_unstemmed | Text segmentation for analysing different languages |
| title_short | Text segmentation for analysing different languages |
| title_sort | text segmentation for analysing different languages |
| topic | QA76 Computer software |
| url | http://eprints.sunway.edu.my/841/ http://eprints.sunway.edu.my/841/ http://eprints.sunway.edu.my/841/1/Teh%20Phoey%20Lee%20conf.pdf |