Suri, Ranga NNR and Murty, Narasimha M and Athithan, G (2016) Detecting outliers in categorical data through rough clustering. In: NATURAL COMPUTING, 15 (3, SI). pp. 385-394.
PDF
Nat_Com_15-3_385_2016.pdf - Published Version Restricted to Registered users only Download (840kB) | Request a copy |
Abstract
Outlier detection is an important data mining task with many contemporary applications. Clustering based methods for outlier detection try to identify the data objects that deviate from the normal data. However, the uncertainty regarding the cluster membership of an outlier object has to be handled appropriately during the clustering process. Additionally, carrying out the clustering process on data described using categorical attributes is challenging, due to the difficulty in defining requisite methods and measures dealing with such data. Addressing these issues, a novel algorithm for clustering categorical data aimed at outlier detection is proposed here by modifying the standard -modes algorithm. The uncertainty regarding the clustering process is addressed by considering a soft computing approach based on rough sets. Accordingly, the modified clustering algorithm incorporates the lower and upper approximation properties of rough sets. The efficacy of the proposed rough -modes clustering algorithm for outlier detection is demonstrated using various benchmark categorical data sets.
Item Type: | Journal Article |
---|---|
Publication: | NATURAL COMPUTING |
Additional Information: | Copy right for this article belongs to the SPRINGER, VAN GODEWIJCKSTRAAT 30, 3311 GZ DORDRECHT, NETHERLANDS |
Department/Centre: | Division of Electrical Sciences > Computer Science & Automation |
Date Deposited: | 28 Oct 2016 07:20 |
Last Modified: | 28 Oct 2016 07:20 |
URI: | http://eprints.iisc.ac.in/id/eprint/55157 |
Actions (login required)
View Item |