ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Detecting outliers in categorical data through rough clustering

Suri, Ranga NNR and Murty, Narasimha M and Athithan, G (2016) Detecting outliers in categorical data through rough clustering. In: NATURAL COMPUTING, 15 (3, SI). pp. 385-394.

[img] PDF
Nat_Com_15-3_385_2016.pdf - Published Version
Restricted to Registered users only

Download (840kB) | Request a copy
Official URL: http://dx.doi.org/10.1007/s11047-015-9489-2


Outlier detection is an important data mining task with many contemporary applications. Clustering based methods for outlier detection try to identify the data objects that deviate from the normal data. However, the uncertainty regarding the cluster membership of an outlier object has to be handled appropriately during the clustering process. Additionally, carrying out the clustering process on data described using categorical attributes is challenging, due to the difficulty in defining requisite methods and measures dealing with such data. Addressing these issues, a novel algorithm for clustering categorical data aimed at outlier detection is proposed here by modifying the standard -modes algorithm. The uncertainty regarding the clustering process is addressed by considering a soft computing approach based on rough sets. Accordingly, the modified clustering algorithm incorporates the lower and upper approximation properties of rough sets. The efficacy of the proposed rough -modes clustering algorithm for outlier detection is demonstrated using various benchmark categorical data sets.

Item Type: Journal Article
Additional Information: Copy right for this article belongs to the SPRINGER, VAN GODEWIJCKSTRAAT 30, 3311 GZ DORDRECHT, NETHERLANDS
Department/Centre: Division of Electrical Sciences > Computer Science & Automation
Date Deposited: 28 Oct 2016 07:20
Last Modified: 28 Oct 2016 07:20
URI: http://eprints.iisc.ac.in/id/eprint/55157

Actions (login required)

View Item View Item