An improved pheromone-based kohonen self-organising map in clustering and visualising balanced and imbalanced datasets

The data distribution issue remains an unsolved clustering problem in data mining, especially in dealing with imbalanced datasets. The Kohonen Self-Organising Map (KSOM) is one of the well-known clustering algorithms that can solve various problems without a pre-defined number of clusters. However,...

Full description

Bibliographic Details
Main Authors: Azlin, Ahmad, Rubiyah, Yusof, Nor Saradatul Akmar, Zulkifli, Mohd Najib, Ismail
Format: Article
Language:English
Published: Universiti Utara Malaysia Press 2021
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/32664/
http://umpir.ump.edu.my/id/eprint/32664/1/An%20improved%20pheromone-based%20kohonen%20self-%20organising%20map.pdf
_version_ 1848824079845425152
author Azlin, Ahmad
Rubiyah, Yusof
Nor Saradatul Akmar, Zulkifli
Mohd Najib, Ismail
author_facet Azlin, Ahmad
Rubiyah, Yusof
Nor Saradatul Akmar, Zulkifli
Mohd Najib, Ismail
author_sort Azlin, Ahmad
building UMP Institutional Repository
collection Online Access
description The data distribution issue remains an unsolved clustering problem in data mining, especially in dealing with imbalanced datasets. The Kohonen Self-Organising Map (KSOM) is one of the well-known clustering algorithms that can solve various problems without a pre-defined number of clusters. However, similar to other clustering algorithms, this algorithm requires sufficient data for its unsupervised learning process. The inadequate amount of class label data in a dataset significantly affects the clustering learning process, leading to inefficient and unreliable results. Numerous research have been conducted by hybridising and optimising the KSOM algorithm with various optimisation techniques. Unfortunately, the problems are still unsolved, especially separation boundary and overlapping clusters. Therefore, this research proposed an improved pheromone- based PKSOM algorithm known as iPKSOM to solve the mentioned problem. Six different datasets, i.e. Iris, Seed, Glass, Titanic, WDBC, and Tropical Wood datasets were chosen to investigate the effectiveness of the iPKSOM algorithm. All datasets were observed and compared with the original KSOM results. This modification significantly impacted the clustering process by improving and refining the scatteredness of clustering data and reducing overlapping clusters. Therefore, this proposed algorithm can be implemented in clustering other complex datasets, such as high dimensional and streaming data.
first_indexed 2025-11-15T03:07:20Z
format Article
id ump-32664
institution Universiti Malaysia Pahang
institution_category Local University
language English
last_indexed 2025-11-15T03:07:20Z
publishDate 2021
publisher Universiti Utara Malaysia Press
recordtype eprints
repository_type Digital Repository
spelling ump-326642022-01-06T04:23:40Z http://umpir.ump.edu.my/id/eprint/32664/ An improved pheromone-based kohonen self-organising map in clustering and visualising balanced and imbalanced datasets Azlin, Ahmad Rubiyah, Yusof Nor Saradatul Akmar, Zulkifli Mohd Najib, Ismail QA76 Computer software The data distribution issue remains an unsolved clustering problem in data mining, especially in dealing with imbalanced datasets. The Kohonen Self-Organising Map (KSOM) is one of the well-known clustering algorithms that can solve various problems without a pre-defined number of clusters. However, similar to other clustering algorithms, this algorithm requires sufficient data for its unsupervised learning process. The inadequate amount of class label data in a dataset significantly affects the clustering learning process, leading to inefficient and unreliable results. Numerous research have been conducted by hybridising and optimising the KSOM algorithm with various optimisation techniques. Unfortunately, the problems are still unsolved, especially separation boundary and overlapping clusters. Therefore, this research proposed an improved pheromone- based PKSOM algorithm known as iPKSOM to solve the mentioned problem. Six different datasets, i.e. Iris, Seed, Glass, Titanic, WDBC, and Tropical Wood datasets were chosen to investigate the effectiveness of the iPKSOM algorithm. All datasets were observed and compared with the original KSOM results. This modification significantly impacted the clustering process by improving and refining the scatteredness of clustering data and reducing overlapping clusters. Therefore, this proposed algorithm can be implemented in clustering other complex datasets, such as high dimensional and streaming data. Universiti Utara Malaysia Press 2021-10 Article PeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/32664/1/An%20improved%20pheromone-based%20kohonen%20self-%20organising%20map.pdf Azlin, Ahmad and Rubiyah, Yusof and Nor Saradatul Akmar, Zulkifli and Mohd Najib, Ismail (2021) An improved pheromone-based kohonen self-organising map in clustering and visualising balanced and imbalanced datasets. Journal of Information and Communication Technology, 20 (4). 651 -676. ISSN 1675-414X (Print); 2180-3862 (Online). (Published) https://doi.org/10.32890/jict2021.20.4.8 https://doi.org/10.32890/jict2021.20.4.8
spellingShingle QA76 Computer software
Azlin, Ahmad
Rubiyah, Yusof
Nor Saradatul Akmar, Zulkifli
Mohd Najib, Ismail
An improved pheromone-based kohonen self-organising map in clustering and visualising balanced and imbalanced datasets
title An improved pheromone-based kohonen self-organising map in clustering and visualising balanced and imbalanced datasets
title_full An improved pheromone-based kohonen self-organising map in clustering and visualising balanced and imbalanced datasets
title_fullStr An improved pheromone-based kohonen self-organising map in clustering and visualising balanced and imbalanced datasets
title_full_unstemmed An improved pheromone-based kohonen self-organising map in clustering and visualising balanced and imbalanced datasets
title_short An improved pheromone-based kohonen self-organising map in clustering and visualising balanced and imbalanced datasets
title_sort improved pheromone-based kohonen self-organising map in clustering and visualising balanced and imbalanced datasets
topic QA76 Computer software
url http://umpir.ump.edu.my/id/eprint/32664/
http://umpir.ump.edu.my/id/eprint/32664/
http://umpir.ump.edu.my/id/eprint/32664/
http://umpir.ump.edu.my/id/eprint/32664/1/An%20improved%20pheromone-based%20kohonen%20self-%20organising%20map.pdf