Ranga Suri, NNR and Murty M, N and Athithan, G (2019) Outlier detection in categorical data. [Book Chapter]
PDF
int_sys_ref_lib_155_69-93_2019.pdf - Published Version Restricted to Registered users only Download (485kB) | Request a copy |
Abstract
This chapter delves on a specific research issue connected with outlier detection problem, namely type of data attributes. More specifically, the case of analyzing data described using categorical attributes/features is presented here. It is known that the performance of a detection algorithm directly depends on the way outliers are perceived. Typically, categorical data are processed by considering the occurrence frequencies of various attributes values. Accordingly, the objective here is to characterize the deviating nature of data objects with respect to individual attributes as well as in the joint distribution of two or more attributes. This can be achieved by defining the measure of deviation in terms of the attribute value frequencies. Also, cluster analysis provides valuable insights on the inherent grouping structure of the data that helps in identifying the deviating objects. Based on this understanding, this chapter presents algorithms developed for detection of outliers in categorical data. © Springer Nature Switzerland AG 2019.
Item Type: | Book Chapter |
---|---|
Publication: | Intelligent Systems Reference Library |
Publisher: | Springer Science and Business Media Deutschland GmbH |
Additional Information: | The copyright for this article belongs to Springer Science and Business Media Deutschland GmbH. |
Department/Centre: | Division of Electrical Sciences > Computer Science & Automation |
Date Deposited: | 28 Nov 2022 09:33 |
Last Modified: | 28 Nov 2022 09:33 |
URI: | https://eprints.iisc.ac.in/id/eprint/78004 |
Actions (login required)
View Item |