Data clustering using variable precision rough set

Clustering a set of objects into homogeneous classes is a fundamental operation in data mining. Several cluster analysis techniques have been developed to group objects having similar characteristics. Recently, many attentions have been put on categorical data clustering, where data objects are made...

Full description

Bibliographic Details
Main Authors: Yanto, Iwan Tri Riyadi, Herawan, Tutut, Mat Deris, Mustafa
Format: Article
Published: IOS Press 2011
Subjects:
Online Access:http://eprints.uthm.edu.my/3582/
Description
Summary:Clustering a set of objects into homogeneous classes is a fundamental operation in data mining. Several cluster analysis techniques have been developed to group objects having similar characteristics. Recently, many attentions have been put on categorical data clustering, where data objects are made up of non-numerical attributes. An algorithm termed MMR using classical rough set theory was proposed to deal with problems in clustering categorical data. However, the MMR algorithm fails to handle noisy data as an integral part of databases. In this paper, an alternative technique for clustering noisy categorical data using Variable Precision Rough Set model is proposed. The results show that the technique provides better performance in selecting the clustering attribute.