ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

PLAStiCC: Predictive Look-Ahead Scheduling for Continuous dataflows on Clouds

Kumbhare, Alok Gautam and Simmhan, Yogesh and Prasanna, Viktor K (2014) PLAStiCC: Predictive Look-Ahead Scheduling for Continuous dataflows on Clouds. In: 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), MAY 26-29, 2014, Chicago, IL, pp. 344-353.

[img] PDF
2014-14th_IEEE-ACM_Int_Sym_on_Clu_Clo_and_Gri_Com_344_2014.pdf - Published Version
Restricted to Registered users only

Download (383kB) | Request a copy
Official URL: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?...

Abstract

Scalable stream processing and continuous dataflow systems are gaining traction with the rise of big data due to the need for processing high velocity data in near real time. Unlike batch processing systems such as MapReduce and workflows, static scheduling strategies fall short for continuous dataflows due to the variations in the input data rates and the need for sustained throughput. The elastic resource provisioning of cloud infrastructure is valuable to meet the changing resource needs of such continuous applications. However, multi-tenant cloud resources introduce yet another dimension of performance variability that impacts the application's throughput. In this paper we propose PLAStiCC, an adaptive scheduling algorithm that balances resource cost and application throughput using a prediction-based lookahead approach. It not only addresses variations in the input data rates but also the underlying cloud infrastructure. In addition, we also propose several simpler static scheduling heuristics that operate in the absence of accurate performance prediction model. These static and adaptive heuristics are evaluated through extensive simulations using performance traces obtained from Amazon AWS IaaS public cloud. Our results show an improvement of up to 20% in the overall profit as compared to the reactive adaptation algorithm.

Item Type: Conference Proceedings
Additional Information: Copy right for this article belongs to the IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA
Keywords: Continuous Dataflows; Predictive scheduling; IaaS Clouds; Elastic resource management; Stream processing
Department/Centre: Division of Interdisciplinary Research > Supercomputer Education & Research Centre
Depositing User: Id for Latest eprints
Date Deposited: 09 Oct 2015 05:36
Last Modified: 09 Oct 2015 05:36
URI: http://eprints.iisc.ac.in/id/eprint/52523

Actions (login required)

View Item View Item