Clustering Patient Medical Records via Sparse Subspace Representation

The health industry is facing increasing challenge with “big data” as traditional methods fail to manage the scale and complexity. This paper examines clustering of patient records for chronic diseases to facilitate a better construction of care plans. We solve this problem under the framework of su...

Full description

Bibliographic Details
Main Authors: Budhaditya, S., Phung, D., Pham, DucSon, Venkatesh, S.
Other Authors: Pei, J.
Format: Conference Paper
Published: Springer 2013
Subjects:
Online Access:http://hdl.handle.net/20.500.11937/46203
Description
Summary:The health industry is facing increasing challenge with “big data” as traditional methods fail to manage the scale and complexity. This paper examines clustering of patient records for chronic diseases to facilitate a better construction of care plans. We solve this problem under the framework of subspace clustering. Our novel contribution lies in the exploitation of sparse representation to discover subspaces automatically and a domain-specific construction of weighting matrices for patient records. We show the new formulation is readily solved by extending existing 1 -regularized optimization algorithms. Using a cohort of both diabetes and stroke data we show that we outperform existing benchmark clustering techniques in the literature.