Discovery of latent subcommunities in a blog's readership

The blogosphere has grown to be a mainstream forum of social interaction as well as a commercially attractive source of information and influence. Tools are needed to better understand how communities that adhere to individual blogs are constituted in order to facilitate new personal, socially-focus...

Full description

Bibliographic Details
Main Authors: Adams, Brett, Phung, Dinh, Venkatesh, Svetha
Format: Journal Article
Published: ACM 2010
Subjects:
Online Access:http://hdl.handle.net/20.500.11937/14244
_version_ 1848748571473477632
author Adams, Brett
Phung, Dinh
Venkatesh, Svetha
author_facet Adams, Brett
Phung, Dinh
Venkatesh, Svetha
author_sort Adams, Brett
building Curtin Institutional Repository
collection Online Access
description The blogosphere has grown to be a mainstream forum of social interaction as well as a commercially attractive source of information and influence. Tools are needed to better understand how communities that adhere to individual blogs are constituted in order to facilitate new personal, socially-focused browsing paradigms, and understand how blog content is consumed, which is of interest to blog authors, big media, and search.We present a novel approach to blog subcommunity characterization by modeling individual blog readers using mixtures of an extension to the LDA family that jointly models phrases and time, Ngram Topic over Time (NTOT), and cluster with a number of similarity measures using Affinity Propagation. We experiment with two datasets: a small set of blogs whose authors provide feedback, and a set of popular, highly commented blogs, which provide indicators of algorithm scalability and interpretability without prior knowledge of agiven blog. The results offer useful insight to the blog authors about their commenting community, and are observed to offer an integrated perspective on the topics of discussion and members engaged in those discussions for unfamiliar blogs. Our approach also holds promise as a component of solutions to related problems, such as online entity resolution and role discovery.
first_indexed 2025-11-14T07:07:10Z
format Journal Article
id curtin-20.500.11937-14244
institution Curtin University Malaysia
institution_category Local University
last_indexed 2025-11-14T07:07:10Z
publishDate 2010
publisher ACM
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-142442017-09-13T15:58:10Z Discovery of latent subcommunities in a blog's readership Adams, Brett Phung, Dinh Venkatesh, Svetha Web communities affinity propagation Algorithms Content Analysis and Indexing Multimedia Information Systems topic models Blog Human Factor The blogosphere has grown to be a mainstream forum of social interaction as well as a commercially attractive source of information and influence. Tools are needed to better understand how communities that adhere to individual blogs are constituted in order to facilitate new personal, socially-focused browsing paradigms, and understand how blog content is consumed, which is of interest to blog authors, big media, and search.We present a novel approach to blog subcommunity characterization by modeling individual blog readers using mixtures of an extension to the LDA family that jointly models phrases and time, Ngram Topic over Time (NTOT), and cluster with a number of similarity measures using Affinity Propagation. We experiment with two datasets: a small set of blogs whose authors provide feedback, and a set of popular, highly commented blogs, which provide indicators of algorithm scalability and interpretability without prior knowledge of agiven blog. The results offer useful insight to the blog authors about their commenting community, and are observed to offer an integrated perspective on the topics of discussion and members engaged in those discussions for unfamiliar blogs. Our approach also holds promise as a component of solutions to related problems, such as online entity resolution and role discovery. 2010 Journal Article http://hdl.handle.net/20.500.11937/14244 10.1145/1806916.1806921 ACM restricted
spellingShingle Web communities
affinity propagation
Algorithms
Content Analysis and Indexing
Multimedia Information Systems
topic models
Blog
Human Factor
Adams, Brett
Phung, Dinh
Venkatesh, Svetha
Discovery of latent subcommunities in a blog's readership
title Discovery of latent subcommunities in a blog's readership
title_full Discovery of latent subcommunities in a blog's readership
title_fullStr Discovery of latent subcommunities in a blog's readership
title_full_unstemmed Discovery of latent subcommunities in a blog's readership
title_short Discovery of latent subcommunities in a blog's readership
title_sort discovery of latent subcommunities in a blog's readership
topic Web communities
affinity propagation
Algorithms
Content Analysis and Indexing
Multimedia Information Systems
topic models
Blog
Human Factor
url http://hdl.handle.net/20.500.11937/14244