Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification

Growing self-organizing map (GSOM) has been introduced as an improvement to the self-organizing map (SOM) algorithm in clustering and knowledge discovery. Unlike the traditional SOM, GSOM has a dynamic structure which allows nodes to grow reflecting the knowledge discovered from the input data as le...

Full description

Bibliographic Details
Main Author: Ahmad, N.
Format: Article
Language:English
Published: 2010
Subjects:
Online Access:http://eprints.utem.edu.my/id/eprint/87/
http://eprints.utem.edu.my/id/eprint/87/1/Norashikin_JournalOfNeuralComputing.pdf
_version_ 1848886883920117760
author Ahmad, N.
author_facet Ahmad, N.
author_sort Ahmad, N.
building UTeM Institutional Repository
collection Online Access
description Growing self-organizing map (GSOM) has been introduced as an improvement to the self-organizing map (SOM) algorithm in clustering and knowledge discovery. Unlike the traditional SOM, GSOM has a dynamic structure which allows nodes to grow reflecting the knowledge discovered from the input data as learning progresses. The spread factor parameter (SF) in GSOM can be utilized to control the spread of the map, thus giving an analyst a flexibility to examine the clusters at different granularities. Although GSOM has been applied in various areas and has been proven effective in knowledge discovery tasks, no comprehensive study has been done on the effect of the spread factor parameter value to the cluster formation and separation. Therefore, the aim of this paper is to investigate the effect of the spread factor value towards cluster separation in the GSOM. We used simple k-means algorithm as a method to identify clusters in the GSOM. By using Davies-Bouldin index, clusters formed by different values of spread factor are obtained and the resulting clusters are analyzed. In this work, we show that clusters can be more separated when the spread factor value is increased. Hierarchical clusters can then be constructed by mapping the GSOM clusters at different spread factor values. © 2009 Springer-Verlag London Limited.
first_indexed 2025-11-15T19:45:35Z
format Article
id utem-87
institution Universiti Teknikal Malaysia Melaka
institution_category Local University
language English
last_indexed 2025-11-15T19:45:35Z
publishDate 2010
recordtype eprints
repository_type Digital Repository
spelling utem-872021-09-19T18:04:59Z http://eprints.utem.edu.my/id/eprint/87/ Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification Ahmad, N. Q Science (General) Growing self-organizing map (GSOM) has been introduced as an improvement to the self-organizing map (SOM) algorithm in clustering and knowledge discovery. Unlike the traditional SOM, GSOM has a dynamic structure which allows nodes to grow reflecting the knowledge discovered from the input data as learning progresses. The spread factor parameter (SF) in GSOM can be utilized to control the spread of the map, thus giving an analyst a flexibility to examine the clusters at different granularities. Although GSOM has been applied in various areas and has been proven effective in knowledge discovery tasks, no comprehensive study has been done on the effect of the spread factor parameter value to the cluster formation and separation. Therefore, the aim of this paper is to investigate the effect of the spread factor value towards cluster separation in the GSOM. We used simple k-means algorithm as a method to identify clusters in the GSOM. By using Davies-Bouldin index, clusters formed by different values of spread factor are obtained and the resulting clusters are analyzed. In this work, we show that clusters can be more separated when the spread factor value is increased. Hierarchical clusters can then be constructed by mapping the GSOM clusters at different spread factor values. © 2009 Springer-Verlag London Limited. 2010 Article PeerReviewed text en http://eprints.utem.edu.my/id/eprint/87/1/Norashikin_JournalOfNeuralComputing.pdf Ahmad, N. (2010) Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification. Neural Computing and Applications, 19 (4). pp. 531-542. ISSN 0941-0643 http://www.scopus.com/inward/record.url?eid=2-s2.0-77952876529&partnerID=40&md5=eb7886ca427a6158351632248739a407
spellingShingle Q Science (General)
Ahmad, N.
Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification
title Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification
title_full Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification
title_fullStr Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification
title_full_unstemmed Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification
title_short Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification
title_sort cluster identification and separation in the growing self-organizing map: application in protein sequence classification
topic Q Science (General)
url http://eprints.utem.edu.my/id/eprint/87/
http://eprints.utem.edu.my/id/eprint/87/
http://eprints.utem.edu.my/id/eprint/87/1/Norashikin_JournalOfNeuralComputing.pdf