A cluster analysis of population based cancer registry in Brunei Darussalam : an exploratory study

Machine learning techniques have been mostly applied in gene expression cancer data. Socio-demographic data available in cancer registries could be explored, to get further insight into relationships between cancer types and their contributing factors. Moreover, less attention has been paid to analy...

Full description

Bibliographic Details
Main Authors: Lai, Daphne Teck Ching, Owais A. Malik
Format: Article
Language:English
Published: Penerbit Universiti Kebangsaan Malaysia 2022
Online Access:http://journalarticle.ukm.my/19427/
http://journalarticle.ukm.my/19427/1/05.pdf
_version_ 1848814838149545984
author Lai, Daphne Teck Ching
Owais A. Malik,
author_facet Lai, Daphne Teck Ching
Owais A. Malik,
author_sort Lai, Daphne Teck Ching
building UKM Institutional Repository
collection Online Access
description Machine learning techniques have been mostly applied in gene expression cancer data. Socio-demographic data available in cancer registries could be explored, to get further insight into relationships between cancer types and their contributing factors. Moreover, less attention has been paid to analyse the mixed demographic data (numeric and categorical) from cancer registries and its association to the cancer types. The aim of this study is to identify subgroups of patients, having similar demographics characteristics, from the population based cancer registry in Brunei Darussalam and examine the prevalent cancer types in these subgroups. Four clustering algorithms are explored in the cluster analysis of Brunei Darussalam Cancer Registry; Two-step, Partitional Around Medoid, Agglomerative Hierarchical and Model-based. Gower distance was used for measuring similarity for mixed data types. To evaluate the clusters found; cluster distribution and Silhouette index were used for cluster quality, Cohen's Kappa Index for cluster stability and Cramer's V Coefficient for clinical relevance of clusters. Six distinct demographic subgroups were consistently found by three algorithms while model-based clustering solution were not considered for deeper analysis as highly imbalanced clusters were produced. The subgroups found have good quality clusters, moderate association with cancer types and high stability. The top three prevalent cancers associated with these subgroups were consistently identified using the three algorithms. Upon comparing the subgroups’ ages during diagnosis, we identify possible screening behaviours of specific subgroups, suggesting for early screening awareness programmes. This study demonstrates the use of cluster analysis in a cancer registry to identify demographic subgroups that could suggest potential areas to develop targeted and improved healthcare management strategies.
first_indexed 2025-11-15T00:40:26Z
format Article
id oai:generic.eprints.org:19427
institution Universiti Kebangasaan Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T00:40:26Z
publishDate 2022
publisher Penerbit Universiti Kebangsaan Malaysia
recordtype eprints
repository_type Digital Repository
spelling oai:generic.eprints.org:194272022-08-18T08:13:15Z http://journalarticle.ukm.my/19427/ A cluster analysis of population based cancer registry in Brunei Darussalam : an exploratory study Lai, Daphne Teck Ching Owais A. Malik, Machine learning techniques have been mostly applied in gene expression cancer data. Socio-demographic data available in cancer registries could be explored, to get further insight into relationships between cancer types and their contributing factors. Moreover, less attention has been paid to analyse the mixed demographic data (numeric and categorical) from cancer registries and its association to the cancer types. The aim of this study is to identify subgroups of patients, having similar demographics characteristics, from the population based cancer registry in Brunei Darussalam and examine the prevalent cancer types in these subgroups. Four clustering algorithms are explored in the cluster analysis of Brunei Darussalam Cancer Registry; Two-step, Partitional Around Medoid, Agglomerative Hierarchical and Model-based. Gower distance was used for measuring similarity for mixed data types. To evaluate the clusters found; cluster distribution and Silhouette index were used for cluster quality, Cohen's Kappa Index for cluster stability and Cramer's V Coefficient for clinical relevance of clusters. Six distinct demographic subgroups were consistently found by three algorithms while model-based clustering solution were not considered for deeper analysis as highly imbalanced clusters were produced. The subgroups found have good quality clusters, moderate association with cancer types and high stability. The top three prevalent cancers associated with these subgroups were consistently identified using the three algorithms. Upon comparing the subgroups’ ages during diagnosis, we identify possible screening behaviours of specific subgroups, suggesting for early screening awareness programmes. This study demonstrates the use of cluster analysis in a cancer registry to identify demographic subgroups that could suggest potential areas to develop targeted and improved healthcare management strategies. Penerbit Universiti Kebangsaan Malaysia 2022-06 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/19427/1/05.pdf Lai, Daphne Teck Ching and Owais A. Malik, (2022) A cluster analysis of population based cancer registry in Brunei Darussalam : an exploratory study. Asia-Pacific Journal of Information Technology and Multimedia, 11 (1). pp. 54-64. ISSN 2289-2192 https://www.ukm.my/apjitm/articles-issues
spellingShingle Lai, Daphne Teck Ching
Owais A. Malik,
A cluster analysis of population based cancer registry in Brunei Darussalam : an exploratory study
title A cluster analysis of population based cancer registry in Brunei Darussalam : an exploratory study
title_full A cluster analysis of population based cancer registry in Brunei Darussalam : an exploratory study
title_fullStr A cluster analysis of population based cancer registry in Brunei Darussalam : an exploratory study
title_full_unstemmed A cluster analysis of population based cancer registry in Brunei Darussalam : an exploratory study
title_short A cluster analysis of population based cancer registry in Brunei Darussalam : an exploratory study
title_sort cluster analysis of population based cancer registry in brunei darussalam : an exploratory study
url http://journalarticle.ukm.my/19427/
http://journalarticle.ukm.my/19427/
http://journalarticle.ukm.my/19427/1/05.pdf