Incorporating Informative Score For Instance Selection In Semi-supervised Sentiment Classification

Sentiment classification is a useful tool to classify reviews that contain a wealth of information about sentiments and attitudes towards a product or service. Existing studies are heavily relying on sentiment classification methods that require fully annotated input. However, there are limited labe...

Full description

Bibliographic Details
Main Author: Vivian, Lee Lay Shan
Format: Thesis
Language:English
Published: 2022
Subjects:
Online Access:http://eprints.usm.my/60138/
http://eprints.usm.my/60138/1/VIVIAN%20LEE%20LAY%20SHAN%20-%20TESIS24.pdf
_version_ 1848884361921822720
author Vivian, Lee Lay Shan
author_facet Vivian, Lee Lay Shan
author_sort Vivian, Lee Lay Shan
building USM Institutional Repository
collection Online Access
description Sentiment classification is a useful tool to classify reviews that contain a wealth of information about sentiments and attitudes towards a product or service. Existing studies are heavily relying on sentiment classification methods that require fully annotated input. However, there are limited labelled text available, making the acquirement process of the fully annotated input costly and labour intensive. In recent years, semi-supervised methods have been positively recommended as they require only partially labelled input and performed comparably to the current preferred methods. At the same time, there are some works reported the performance of semi-supervised model degraded after adding unlabelled instances into training. The contrast of the current literature shows that not all unlabelled instances are equally useful; thus identifying the informative unlabelled instances is beneficial in training a semi-supervised model. To achieve this, informative score is proposed and incorporated into semi-supervised sentiment classification. The experiment compared the accuracy and loss of supervised method, semi-supervised method without informative score and semi-supervised method with informative score. With the help of informative score to identify informative unlabelled instances, semi-supervised models can perform better compared to semi-supervised models that do not incorporate informative score into its training. Although performance of semi-supervised models incorporated with informative score are not able to surpass the supervised models, the results are still found promising as the differences in performance are subtle and the number of labelled instances used are greatly reduced.
first_indexed 2025-11-15T19:05:29Z
format Thesis
id usm-60138
institution Universiti Sains Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T19:05:29Z
publishDate 2022
recordtype eprints
repository_type Digital Repository
spelling usm-601382024-03-12T03:51:25Z http://eprints.usm.my/60138/ Incorporating Informative Score For Instance Selection In Semi-supervised Sentiment Classification Vivian, Lee Lay Shan QA76.9.M35 Computer science -- Mathematics Sentiment classification is a useful tool to classify reviews that contain a wealth of information about sentiments and attitudes towards a product or service. Existing studies are heavily relying on sentiment classification methods that require fully annotated input. However, there are limited labelled text available, making the acquirement process of the fully annotated input costly and labour intensive. In recent years, semi-supervised methods have been positively recommended as they require only partially labelled input and performed comparably to the current preferred methods. At the same time, there are some works reported the performance of semi-supervised model degraded after adding unlabelled instances into training. The contrast of the current literature shows that not all unlabelled instances are equally useful; thus identifying the informative unlabelled instances is beneficial in training a semi-supervised model. To achieve this, informative score is proposed and incorporated into semi-supervised sentiment classification. The experiment compared the accuracy and loss of supervised method, semi-supervised method without informative score and semi-supervised method with informative score. With the help of informative score to identify informative unlabelled instances, semi-supervised models can perform better compared to semi-supervised models that do not incorporate informative score into its training. Although performance of semi-supervised models incorporated with informative score are not able to surpass the supervised models, the results are still found promising as the differences in performance are subtle and the number of labelled instances used are greatly reduced. 2022-05 Thesis NonPeerReviewed application/pdf en http://eprints.usm.my/60138/1/VIVIAN%20LEE%20LAY%20SHAN%20-%20TESIS24.pdf Vivian, Lee Lay Shan (2022) Incorporating Informative Score For Instance Selection In Semi-supervised Sentiment Classification. Masters thesis, Universiti Sains Malaysia.
spellingShingle QA76.9.M35 Computer science -- Mathematics
Vivian, Lee Lay Shan
Incorporating Informative Score For Instance Selection In Semi-supervised Sentiment Classification
title Incorporating Informative Score For Instance Selection In Semi-supervised Sentiment Classification
title_full Incorporating Informative Score For Instance Selection In Semi-supervised Sentiment Classification
title_fullStr Incorporating Informative Score For Instance Selection In Semi-supervised Sentiment Classification
title_full_unstemmed Incorporating Informative Score For Instance Selection In Semi-supervised Sentiment Classification
title_short Incorporating Informative Score For Instance Selection In Semi-supervised Sentiment Classification
title_sort incorporating informative score for instance selection in semi-supervised sentiment classification
topic QA76.9.M35 Computer science -- Mathematics
url http://eprints.usm.my/60138/
http://eprints.usm.my/60138/1/VIVIAN%20LEE%20LAY%20SHAN%20-%20TESIS24.pdf