Classification and pattern discovery of mood in weblogs

Automatic data-driven analysis of mood from text is anemerging problem with many potential applications. Unlike generic text categorization, mood classification based on textual features is complicated by various factors, including its context- and user-sensitive nature. We present a comprehensive s...

Full description

Bibliographic Details
Main Authors: Nguyen, Thin, Phung, Dinh, Adams, Brett, Tran, Truyen, Venkatesh, Svetha
Other Authors: M. Zaki
Format: Conference Paper
Published: Springer-Verlag 2010
Online Access:http://hdl.handle.net/20.500.11937/47245
_version_ 1848757780987510784
author Nguyen, Thin
Phung, Dinh
Adams, Brett
Tran, Truyen
Venkatesh, Svetha
author2 M. Zaki
author_facet M. Zaki
Nguyen, Thin
Phung, Dinh
Adams, Brett
Tran, Truyen
Venkatesh, Svetha
author_sort Nguyen, Thin
building Curtin Institutional Repository
collection Online Access
description Automatic data-driven analysis of mood from text is anemerging problem with many potential applications. Unlike generic text categorization, mood classification based on textual features is complicated by various factors, including its context- and user-sensitive nature. We present a comprehensive study of different feature selection schemes in machine learning for the problem of mood classification in weblogs. Notably, we introduce the novel use of a feature set based on the affective norms for English words (ANEW) lexicon studied in psychology. This feature set has the advantage of being computationally efficient while maintaining accuracy comparable to other state-of-the-art feature sets experimented with. In addition, we present results of data-driven clustering on a dataset of over 17 million blog posts with mood groundtruth. Our analysis reveals an interesting, and readily interpreted, structure to the linguistic expression of emotion, one that comprises valuable empirical evidence in support of existing psychological models of emotion, and in particular the dipoles pleasure-displeasure and activation-deactivation.
first_indexed 2025-11-14T09:33:32Z
format Conference Paper
id curtin-20.500.11937-47245
institution Curtin University Malaysia
institution_category Local University
last_indexed 2025-11-14T09:33:32Z
publishDate 2010
publisher Springer-Verlag
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-472452023-01-18T08:46:45Z Classification and pattern discovery of mood in weblogs Nguyen, Thin Phung, Dinh Adams, Brett Tran, Truyen Venkatesh, Svetha M. Zaki J. Yu B. Ravindran V. Pudi Automatic data-driven analysis of mood from text is anemerging problem with many potential applications. Unlike generic text categorization, mood classification based on textual features is complicated by various factors, including its context- and user-sensitive nature. We present a comprehensive study of different feature selection schemes in machine learning for the problem of mood classification in weblogs. Notably, we introduce the novel use of a feature set based on the affective norms for English words (ANEW) lexicon studied in psychology. This feature set has the advantage of being computationally efficient while maintaining accuracy comparable to other state-of-the-art feature sets experimented with. In addition, we present results of data-driven clustering on a dataset of over 17 million blog posts with mood groundtruth. Our analysis reveals an interesting, and readily interpreted, structure to the linguistic expression of emotion, one that comprises valuable empirical evidence in support of existing psychological models of emotion, and in particular the dipoles pleasure-displeasure and activation-deactivation. 2010 Conference Paper http://hdl.handle.net/20.500.11937/47245 Springer-Verlag restricted
spellingShingle Nguyen, Thin
Phung, Dinh
Adams, Brett
Tran, Truyen
Venkatesh, Svetha
Classification and pattern discovery of mood in weblogs
title Classification and pattern discovery of mood in weblogs
title_full Classification and pattern discovery of mood in weblogs
title_fullStr Classification and pattern discovery of mood in weblogs
title_full_unstemmed Classification and pattern discovery of mood in weblogs
title_short Classification and pattern discovery of mood in weblogs
title_sort classification and pattern discovery of mood in weblogs
url http://hdl.handle.net/20.500.11937/47245