ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Tree structure for efficient data mining using rough sets

Ananthanarayana, VS and Murty, Narasimha M and Subramanian, DK (2003) Tree structure for efficient data mining using rough sets. In: Pattern Recognition Letters, 24 (6). pp. 851-862.

[img] PDF
Tree_structure.pdf - Published Version
Restricted to Registered users only

Download (246kB) | Request a copy
Official URL: http://dx.doi.org/10.1016/S0167-8655(02)00197-6


In data mining, an important goal is to generate an abstraction of the data. Such an abstraction helps in reducing the space and search time requirements of the overall decision making process. Further, it is important that the abstraction is generated from the data with a small number of disk scans. We propose a novel data structure, pattern count tree (PC-tree), that can be built by scanning the database only once. PC-tree is a minimal size complete representation of the data and it can be used to represent dynamic databases with the help of knowledge that is either static or changing. We show that further compactness can be achieved by constructing the PC-tree on segmented patterns. We exploit the flexibility offered by rough sets to realize a rough PC-tree and use it for efficient and effective rough classification. To be consistent with the sizes of the branches of the PC-tree, we use upper and lower approximations of feature sets in a manner different from the conventional rough set theory. We conducted experiments using the proposed classification scheme on a large-scale hand-written digit data set. We use the experimental results to establish the efficacy of the proposed approach. (C) 2002 Elsevier Science B.V. All rights reserved.

Item Type: Journal Article
Publication: Pattern Recognition Letters
Publisher: Elsevier Science
Additional Information: Copyright of this article belongs to Elsevier Science.
Keywords: PC-tree;Single database scan; Dynamic mining;Segment PC-tree; Rough PC-tree;Classification;Rough set
Department/Centre: Division of Electrical Sciences > Computer Science & Automation
Date Deposited: 25 Aug 2011 05:46
Last Modified: 25 Aug 2011 05:46
URI: http://eprints.iisc.ac.in/id/eprint/40295

Actions (login required)

View Item View Item