Karthik, Srinivas and Haritsa, Jayant R and Kenkre, Sreyash and Pandit, Vinayaka (2016) Platform-independent Robust Query Processing. In: 32nd IEEE International Conference on Data Engineering (ICDE), MAY 16-20, 2016, Helsinki, FINLAND, pp. 325-336.
PDF
IEEE_Int_Con_Dad_Eng_325_2016.pdf - Published Version Restricted to Registered users only Download (756kB) | Request a copy |
Abstract
To address the classical selectivity estimation problem in databases, a radically different approach called PlanBouquet was recently proposed in 3], wherein the estimation process is completely abandoned and replaced with a calibrated discovery mechanism. The beneficial outcome of this new construction is that, for the first time, provable guarantees are obtained on worst-case performance, thereby facilitating robust query processing. The PlanBouquet formulation suffers, however, from a systemic drawback - the performance bound is a function of not only the query, but also the optimizer's behavioral profile over the underlying database platform. As a result, there are adverse consequences: (i) the bound value becomes highly variable, depending on the specifics of the current operating environment, and (ii) it becomes infeasible to compute the value without substantial investments in preprocessing overheads. In this paper, we present SpillBound, a new query processing algorithm that retains the core strength of the PlanBouquet discovery process, but reduces the bound dependency to only the query. Specifically, SpillBound delivers a worst-case multiplicative bound of D-2 + 3D, where D is simply the number of error-prone predicates in the user query. Consequently, the bound value becomes independent of the optimizer and the database platform, and the guarantee can be issued just by inspecting the query, without incurring any additional computational effort. We go on to prove that SpillBound is within an O(D) factor of the best possible deterministic selectivity discovery algorithm in its class. Further, a detailed empirical evaluation over the standard TPC-H and TPC-DS benchmarks indicates that SpillBound provides markedly superior worst-case performance as compared to PlanBouquet in practice. Therefore, in an overall sense, SpillBound offers a substantive step forward in the quest for robust query processing.
Item Type: | Conference Proceedings |
---|---|
Series.: | IEEE International Conference on Data Engineering |
Additional Information: | Copy right for this article belongs to the IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA |
Department/Centre: | Division of Interdisciplinary Sciences > Computational and Data Sciences Division of Interdisciplinary Sciences > Supercomputer Education & Research Centre |
Date Deposited: | 22 Oct 2016 09:30 |
Last Modified: | 26 Nov 2018 15:17 |
URI: | http://eprints.iisc.ac.in/id/eprint/55039 |
Actions (login required)
View Item |