Achar, Avinash and Ibrahim, A and Sastry, PS (2013) Pattern-growth based frequent serial episode discovery. In: DATA & KNOWLEDGE ENGINEERING, 87 . pp. 91-108.
PDF
Dat_kno_eng_87_91_2013.pdf - Published Version Restricted to Registered users only Download (1MB) | Request a copy |
Abstract
Frequent episode discovery is a popular framework for pattern discovery from sequential data. It has found many applications in domains like alarm management in telecommunication networks, fault analysis in the manufacturing plants, predicting user behavior in web click streams and so on. In this paper, we address the discovery of serial episodes. In the episodes context, there have been multiple ways to quantify the frequency of an episode. Most of the current algorithms for episode discovery under various frequencies are apriori-based level-wise methods. These methods essentially perform a breadth-first search of the pattern space. However currently there are no depth-first based methods of pattern discovery in the frequent episode framework under many of the frequency definitions. In this paper, we try to bridge this gap. We provide new depth-first based algorithms for serial episode discovery under non-overlapped and total frequencies. Under non-overlapped frequency, we present algorithms that can take care of span constraint and gap constraint on episode occurrences. Under total frequency we present an algorithm that can handle span constraint. We provide proofs of correctness for the proposed algorithms. We demonstrate the effectiveness of the proposed algorithms by extensive simulations. We also give detailed run-time comparisons with the existing apriori-based methods and illustrate scenarios under which the proposed pattern-growth algorithms perform better than their apriori counterparts. (C) 2013 Elsevier B.V. All rights reserved.
Item Type: | Journal Article |
---|---|
Publication: | DATA & KNOWLEDGE ENGINEERING |
Publisher: | ELSEVIER SCIENCE BV |
Additional Information: | Copyright for this article belongs to Elsevier Science |
Keywords: | Mining methods and algorithms; Frequent episodes; Non-overlapped frequency; Span constraints; Gap constraints; Depth-first search |
Department/Centre: | Division of Electrical Sciences > Electrical Engineering |
Date Deposited: | 16 Dec 2013 06:48 |
Last Modified: | 16 Dec 2013 06:48 |
URI: | http://eprints.iisc.ac.in/id/eprint/47901 |
Actions (login required)
View Item |