ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Data Clustering: A Review

Jain, AK and Murty, MN and Flynn, PJ (1999) Data Clustering: A Review. In: ACM Computing Surveys, 31 (3). pp. 264-323.

[img]
Preview
PDF
p264-jain.pdf

Download (678kB)

Abstract

Clustering is the unsupervised classification of patterns(observations, data items,or feature vectors) into groups (clusters).The clustering problem has been addressed in many contexts and by researchers in many disciplines; this reflects its broad appeal and usefulness as one of the steps in exploratory data analysis. However, clustering is a difficult problem combinatorially,and differences in assumptions and contexts in different communities has made the transfer of useful generic concepts and methodologies slow to occur.This paper presents an overview of pattern clustering methods from a statistical pattern recognition perspective,with a goal of providing useful advice and references to fundamental concepts accessible to the broad community of clustering practitioners. We present a taxonomy of clustering techniques, and identify cross cutting themes and recent advances. We also describe some important applications of clustering algorithms such as image segmentation, object recognition, and information retrieval.

Item Type: Journal Article
Publication: ACM Computing Surveys
Publisher: ACM Press
Additional Information: ©ACM,1999.This is the author's version of the work.It is posted here by permission of ACM for your personal use. Not for redistribution.The definitive version was published in ACM Computing Surveys, VOL 31, ISS 3, September 1999 http://doi.acm.org/10.1145/331499.331504
Keywords: Algorithms;Cluster analysis;clustering applications;exploratory data analysis;incremental clustering;similarity indices;unsupervised learning
Department/Centre: Division of Electrical Sciences > Computer Science & Automation
Date Deposited: 08 Jun 2004
Last Modified: 19 Sep 2010 04:12
URI: http://eprints.iisc.ac.in/id/eprint/273

Actions (login required)

View Item View Item