ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Knowledge-infused and consistent Complex Event Processing over real-time and persistent streams

Zhou, Qunzhi and Simmhan, Yogesh and Prasanna, Viktor (2017) Knowledge-infused and consistent Complex Event Processing over real-time and persistent streams. In: FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 76 . pp. 391-406.

[img] PDF
Fut_Gen_Com_Sys_Int_Jou_Esc_76_391_2017.pdf - Published Version
Restricted to Registered users only

Download (2MB) | Request a copy
Official URL: http://doi.org/10.1016/j.future.2016.10.030

Abstract

Emerging applications in Internet of Things (IoT) and Cyber-Physical Systems (CPS) present novel challenges to Big Data platforms for performing online analytics. Ubiquitous sensors from IoT deployments are able to generate data streams at high velocity, that include information from a variety of domains, and accumulate to large volumes on disk. Complex Event Processing (CEP) is recognized as an important real-time computing paradigm for analyzing continuous data streams. However, existing work on CEP is largely limited to relational query processing, exposing two distinctive gaps for query specification and execution: (1) infusing the relational query model with higher level knowledge semantics, and (2) seamless query evaluation across temporal spaces that span past, present and future events. These allow accessible analytics over data streams having properties from different disciplines, and help span the velocity (real-time) and volume (persistent) dimensions. In this article, we introduce a Knowledge infused CEP (chi-CEP) framework that provides domain-aware knowledge query constructs along with temporal operators that allow end-to-end queries to span across real-time and persistent streams. We translate this query model to efficient query execution over online and offline data streams, proposing several optimizations to mitigate the overheads introduced by evaluating semantic predicates and in accessing high-volume historic data streams. In particular, we also address temporal consistency issues that arise during fault recovery of query plans that span the boundary between real-time and persistent streams. The proposed chi-CEP query model and execution approaches are implemented in our prototype semantic CEP engine, SCEPter. We validate our query model using domain-aware CEP queries from a real-world Smart Power Grid application, and experimentally analyze the benefits of our optimizations for executing these queries, using event streams from a campus-microgrid loT deployment. Our results show that we are able to sustain a processing throughput of 3, 000 events/secs for chi-CEP queries, a 30 x improvement over the baseline and sufficient to support a Smart Township, and can resume consistent processing within 20 secs after stream outages as long as 2 hours. (C) 2016 Elsevier B.V. All rights reserved.

Item Type: Journal Article
Publication: FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE
Additional Information: Copy right for this article belongs to the ELSEVIER SCIENCE BV, PO BOX 211, 1000 AE AMSTERDAM, NETHERLANDS
Department/Centre: Division of Interdisciplinary Sciences > Computational and Data Sciences
Date Deposited: 16 Sep 2017 06:30
Last Modified: 11 Oct 2018 12:53
URI: http://eprints.iisc.ac.in/id/eprint/57779

Actions (login required)

View Item View Item