A New Evolving Tree-Based Model with Local Re-learning for Document Clustering and Visualization

The Evolving tree (ETree) is a hierarchical clustering and visualization model that allows the number of clusters to grow and evolve with new data samples in an online learning manner. While many hierarchical clustering models are available in the literature, ETree stands out because of its visualiz...

Full description

Bibliographic Details
Main Authors: Chang, Wuilee, Tay, Kai Meng, Lim, Cheepeng
Format: Article
Language:English
Published: Springer New York LLC 2017
Subjects:
Online Access:http://ir.unimas.my/id/eprint/15417/
http://ir.unimas.my/id/eprint/15417/7/A%20New%20Evolving%20Tree-Based%20Model%20with%20Local%20%28abstract%29.pdf
_version_ 1848837848413765632
author Chang, Wuilee
Tay, Kai Meng
Lim, Cheepeng
author_facet Chang, Wuilee
Tay, Kai Meng
Lim, Cheepeng
author_sort Chang, Wuilee
building UNIMAS Institutional Repository
collection Online Access
description The Evolving tree (ETree) is a hierarchical clustering and visualization model that allows the number of clusters to grow and evolve with new data samples in an online learning manner. While many hierarchical clustering models are available in the literature, ETree stands out because of its visualization capability. It is an enhancement of the Self-Organizing Map, a famous and useful clustering and visualization model. ETree organises the trained data samples in the form of a tree structure for better presentation and visualization especially for high-dimensional data samples. Even though ETree has been used in a number of applications, its use in textual document clustering and visualization is limited. In this paper, ETree is modified and deployed as a useful model for undertaking textual documents clustering and visualization problems. We introduce a new local re-learning procedure that allows the tree structure to grow and adapt to new features, i.e., new words from new textual documents. The performance of the proposed ETree model is evaluated with two (one benchmark and one real) document data sets. A number of key aspects of the proposed ETree model, which include its topology representation, learning time, as well as recall and precision rates, are evaluated. The results show that the proposed local re-learning procedure is useful for handling increasing number of features incrementally. In summary, this study contributes towards a modified ETree model and its use in a new domain, i.e., textual document clustering and visualization.
first_indexed 2025-11-15T06:46:11Z
format Article
id unimas-15417
institution Universiti Malaysia Sarawak
institution_category Local University
language English
last_indexed 2025-11-15T06:46:11Z
publishDate 2017
publisher Springer New York LLC
recordtype eprints
repository_type Digital Repository
spelling unimas-154172017-04-12T02:40:22Z http://ir.unimas.my/id/eprint/15417/ A New Evolving Tree-Based Model with Local Re-learning for Document Clustering and Visualization Chang, Wuilee Tay, Kai Meng Lim, Cheepeng QA Mathematics QA76 Computer software The Evolving tree (ETree) is a hierarchical clustering and visualization model that allows the number of clusters to grow and evolve with new data samples in an online learning manner. While many hierarchical clustering models are available in the literature, ETree stands out because of its visualization capability. It is an enhancement of the Self-Organizing Map, a famous and useful clustering and visualization model. ETree organises the trained data samples in the form of a tree structure for better presentation and visualization especially for high-dimensional data samples. Even though ETree has been used in a number of applications, its use in textual document clustering and visualization is limited. In this paper, ETree is modified and deployed as a useful model for undertaking textual documents clustering and visualization problems. We introduce a new local re-learning procedure that allows the tree structure to grow and adapt to new features, i.e., new words from new textual documents. The performance of the proposed ETree model is evaluated with two (one benchmark and one real) document data sets. A number of key aspects of the proposed ETree model, which include its topology representation, learning time, as well as recall and precision rates, are evaluated. The results show that the proposed local re-learning procedure is useful for handling increasing number of features incrementally. In summary, this study contributes towards a modified ETree model and its use in a new domain, i.e., textual document clustering and visualization. Springer New York LLC 2017-02-06 Article PeerReviewed text en http://ir.unimas.my/id/eprint/15417/7/A%20New%20Evolving%20Tree-Based%20Model%20with%20Local%20%28abstract%29.pdf Chang, Wuilee and Tay, Kai Meng and Lim, Cheepeng (2017) A New Evolving Tree-Based Model with Local Re-learning for Document Clustering and Visualization. Neural Processing Letters. pp. 1-31. ISSN 13704621 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85011685221&doi=10.1007%2fs11063-017-9597-3&partnerID=40&md5=513265abde740ea4309f1d20d99e81e0 DOI: 10.1007/s11063-017-9597-3
spellingShingle QA Mathematics
QA76 Computer software
Chang, Wuilee
Tay, Kai Meng
Lim, Cheepeng
A New Evolving Tree-Based Model with Local Re-learning for Document Clustering and Visualization
title A New Evolving Tree-Based Model with Local Re-learning for Document Clustering and Visualization
title_full A New Evolving Tree-Based Model with Local Re-learning for Document Clustering and Visualization
title_fullStr A New Evolving Tree-Based Model with Local Re-learning for Document Clustering and Visualization
title_full_unstemmed A New Evolving Tree-Based Model with Local Re-learning for Document Clustering and Visualization
title_short A New Evolving Tree-Based Model with Local Re-learning for Document Clustering and Visualization
title_sort new evolving tree-based model with local re-learning for document clustering and visualization
topic QA Mathematics
QA76 Computer software
url http://ir.unimas.my/id/eprint/15417/
http://ir.unimas.my/id/eprint/15417/
http://ir.unimas.my/id/eprint/15417/
http://ir.unimas.my/id/eprint/15417/7/A%20New%20Evolving%20Tree-Based%20Model%20with%20Local%20%28abstract%29.pdf