APSCAN: A parameter free algorithm for clustering

DBSCAN is a density based clustering algorithm and its effectiveness for spatial datasets has been demonstrated in the existing literature. However, there are two distinct drawbacks for DBSCAN: (i) the performances of clustering depend on two specified parameters. One is the maximum radius of a neig...

Full description

Bibliographic Details
Main Authors: Chen, Xiaoming, Liu, Wan-quan, Huining, Q., Lai, J.
Format: Journal Article
Published: Elsevier BV, North-Holland 2011
Subjects:
Online Access:http://hdl.handle.net/20.500.11937/26657
_version_ 1848752048767500288
author Chen, Xiaoming
Liu, Wan-quan
Huining, Q.
Lai, J.
author_facet Chen, Xiaoming
Liu, Wan-quan
Huining, Q.
Lai, J.
author_sort Chen, Xiaoming
building Curtin Institutional Repository
collection Online Access
description DBSCAN is a density based clustering algorithm and its effectiveness for spatial datasets has been demonstrated in the existing literature. However, there are two distinct drawbacks for DBSCAN: (i) the performances of clustering depend on two specified parameters. One is the maximum radius of a neighborhood and the other is the minimum number of the data points contained in such neighborhood. In fact these two specified parameters define a single density. Nevertheless, without enough prior knowledge, these two parameters are difficult to be determined; (ii) with these two parameters for a single density, DBSCAN does not perform well to datasets with varying densities. The above two issues bring some difficulties in applications. To address these two problems in a systematic way, in this paper we propose a novel parameter free clustering algorithm named as APSCAN. Firstly, we utilize the Affinity Propagation (AP) algorithm to detect local densities for a dataset and generate a normalized density list. Secondly, we combine the first pair of density parameters with any other pair of density parameters in the normalized density list as input parameters for a proposed DDBSCAN (Double-Density-Based SCAN) to produce a set of clustering results. In this way, we can obtain different clustering results with varying density parameters derived from the normalized density list. Thirdly, we develop an updated rule for the results obtained by implementing the DDBSCAN with different input parameters and then synthesize these clustering results into a final result. The proposed APSCAN has two advantages: first it does not need to predefine the two parameters as required in DBSCAN and second, it not only can cluster datasets with varying densities but also preserve the nonlinear data structure for such datasets.
first_indexed 2025-11-14T08:02:26Z
format Journal Article
id curtin-20.500.11937-26657
institution Curtin University Malaysia
institution_category Local University
last_indexed 2025-11-14T08:02:26Z
publishDate 2011
publisher Elsevier BV, North-Holland
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-266572017-09-13T16:09:10Z APSCAN: A parameter free algorithm for clustering Chen, Xiaoming Liu, Wan-quan Huining, Q. Lai, J. DBSCAN Clustering algorithm Affinity propagation algorithm DBSCAN is a density based clustering algorithm and its effectiveness for spatial datasets has been demonstrated in the existing literature. However, there are two distinct drawbacks for DBSCAN: (i) the performances of clustering depend on two specified parameters. One is the maximum radius of a neighborhood and the other is the minimum number of the data points contained in such neighborhood. In fact these two specified parameters define a single density. Nevertheless, without enough prior knowledge, these two parameters are difficult to be determined; (ii) with these two parameters for a single density, DBSCAN does not perform well to datasets with varying densities. The above two issues bring some difficulties in applications. To address these two problems in a systematic way, in this paper we propose a novel parameter free clustering algorithm named as APSCAN. Firstly, we utilize the Affinity Propagation (AP) algorithm to detect local densities for a dataset and generate a normalized density list. Secondly, we combine the first pair of density parameters with any other pair of density parameters in the normalized density list as input parameters for a proposed DDBSCAN (Double-Density-Based SCAN) to produce a set of clustering results. In this way, we can obtain different clustering results with varying density parameters derived from the normalized density list. Thirdly, we develop an updated rule for the results obtained by implementing the DDBSCAN with different input parameters and then synthesize these clustering results into a final result. The proposed APSCAN has two advantages: first it does not need to predefine the two parameters as required in DBSCAN and second, it not only can cluster datasets with varying densities but also preserve the nonlinear data structure for such datasets. 2011 Journal Article http://hdl.handle.net/20.500.11937/26657 10.1016/j.patrec.2011.02.001 Elsevier BV, North-Holland restricted
spellingShingle DBSCAN
Clustering algorithm
Affinity propagation algorithm
Chen, Xiaoming
Liu, Wan-quan
Huining, Q.
Lai, J.
APSCAN: A parameter free algorithm for clustering
title APSCAN: A parameter free algorithm for clustering
title_full APSCAN: A parameter free algorithm for clustering
title_fullStr APSCAN: A parameter free algorithm for clustering
title_full_unstemmed APSCAN: A parameter free algorithm for clustering
title_short APSCAN: A parameter free algorithm for clustering
title_sort apscan: a parameter free algorithm for clustering
topic DBSCAN
Clustering algorithm
Affinity propagation algorithm
url http://hdl.handle.net/20.500.11937/26657