Text segmentation for analysing different languages

Over the past several years, researchers have applied different methods of text segmentation. Text segmentation is defined as a method of splitting a document into smaller segments, assuming with its own relevant meaning. Those segments can be classified into the tag, word, sentence, topic, phrase a...

Full description

Bibliographic Details
Main Authors: Pak, Irina *, Teh, Phoey Lee *
Format: Conference or Workshop Item
Language:English
Published: 2016
Subjects:
Online Access:http://eprints.sunway.edu.my/841/
http://eprints.sunway.edu.my/841/1/Teh%20Phoey%20Lee%20conf.pdf
_version_ 1848801908702052352
author Pak, Irina *
Teh, Phoey Lee *
author_facet Pak, Irina *
Teh, Phoey Lee *
author_sort Pak, Irina *
building SU Institutional Repository
collection Online Access
description Over the past several years, researchers have applied different methods of text segmentation. Text segmentation is defined as a method of splitting a document into smaller segments, assuming with its own relevant meaning. Those segments can be classified into the tag, word, sentence, topic, phrase and any information unit. Firstly, this study reviews the different types of text segmentation methods used in different types of documentation, and later discusses the various reasons for utilizing it in opinion mining. The main contribution of this study includes a summarisation of research papers from the past 10 years that applied text segmentation as their main approach in text analysing. Results show that word segmentation was successfully and widely used for processing different languages.
first_indexed 2025-11-14T21:14:56Z
format Conference or Workshop Item
id sunway-841
institution Sunway University
institution_category Local University
language English
last_indexed 2025-11-14T21:14:56Z
publishDate 2016
recordtype eprints
repository_type Digital Repository
spelling sunway-8412019-07-23T01:41:14Z http://eprints.sunway.edu.my/841/ Text segmentation for analysing different languages Pak, Irina * Teh, Phoey Lee * QA76 Computer software Over the past several years, researchers have applied different methods of text segmentation. Text segmentation is defined as a method of splitting a document into smaller segments, assuming with its own relevant meaning. Those segments can be classified into the tag, word, sentence, topic, phrase and any information unit. Firstly, this study reviews the different types of text segmentation methods used in different types of documentation, and later discusses the various reasons for utilizing it in opinion mining. The main contribution of this study includes a summarisation of research papers from the past 10 years that applied text segmentation as their main approach in text analysing. Results show that word segmentation was successfully and widely used for processing different languages. 2016-11-11 Conference or Workshop Item PeerReviewed text en http://eprints.sunway.edu.my/841/1/Teh%20Phoey%20Lee%20conf.pdf Pak, Irina * and Teh, Phoey Lee * (2016) Text segmentation for analysing different languages. In: COMSE 2016 - First EAI International Conference on Computer Science and Engineering, 11-12 November 2016, Penang, Malaysia. http://compse-conf.org/2016/show/home
spellingShingle QA76 Computer software
Pak, Irina *
Teh, Phoey Lee *
Text segmentation for analysing different languages
title Text segmentation for analysing different languages
title_full Text segmentation for analysing different languages
title_fullStr Text segmentation for analysing different languages
title_full_unstemmed Text segmentation for analysing different languages
title_short Text segmentation for analysing different languages
title_sort text segmentation for analysing different languages
topic QA76 Computer software
url http://eprints.sunway.edu.my/841/
http://eprints.sunway.edu.my/841/
http://eprints.sunway.edu.my/841/1/Teh%20Phoey%20Lee%20conf.pdf