copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Smooth Scan: Statistics-oblivious access paths

R. Borovica-Gajic, S. Idreos, A. Ailamaki, M. Zukowski, and C. Fraser. Data Engineering (ICDE), 2015 IEEE 31st International Conference on, page 315--326. (April 2015)

Abstract

Query optimizers depend heavily on statistics representing column distributions to create efficient query plans. In many cases, though, statistics are outdated or non-existent, and the process of refreshing statistics is very expensive, especially for ad-hoc workloads on ever bigger data. This results in suboptimal plans that severely hurt performance. The main problem is that any decision, once made by the optimizer, is fixed throughout the execution of a query. In particular, each logical operator translates into a fixed choice of a physical operator at run-time. In this paper, we advocate for continuous adaptation and morphing of physical operators throughout their lifetime, by adjusting their behavior in accordance with the statistical properties of the data. We demonstrate the benefits of the new paradigm by designing and implementing an adaptive access path operator called Smooth Scan, which morphs continuously within the space of traditional index access and full table scan. Smooth Scan behaves similarly to an index scan for low selectivity; if selectivity increases, however, Smooth Scan progressively morphs its behavior toward a sequential scan. As a result, a system with Smooth Scan requires no access path decisions up front nor does it need accurate statistics to provide good performance. We implement Smooth Scan in PostgreSQL and, using both synthetic benchmarks as well as TPC-H, we show that it achieves robust performance while at the same time being statistics-oblivious.

Links and resources

BibTeX key: Borovica-Gajic2015-mh
entry type: inproceedings
booktitle: Data Engineering (ICDE), 2015 IEEE 31st International Conference on
year: 2015
month: apr
pages: 315--326

@christophv's tags highlighted

Cite this publication

%0 Conference Paper %1 Borovica-Gajic2015-mh %A Borovica-Gajic, Renata %A Idreos, Stratos %A Ailamaki, Anastasia %A Zukowski, Marcin %A Fraser, Campbell %B Data Engineering (ICDE), 2015 IEEE 31st International Conference on %D 2015 %K Complexity_theory Estimation Expose Indexes Probes Query_processing Robustness Switches %P 315--326 %T Smooth Scan: Statistics-oblivious access paths %X Query optimizers depend heavily on statistics representing column distributions to create efficient query plans. In many cases, though, statistics are outdated or non-existent, and the process of refreshing statistics is very expensive, especially for ad-hoc workloads on ever bigger data. This results in suboptimal plans that severely hurt performance. The main problem is that any decision, once made by the optimizer, is fixed throughout the execution of a query. In particular, each logical operator translates into a fixed choice of a physical operator at run-time. In this paper, we advocate for continuous adaptation and morphing of physical operators throughout their lifetime, by adjusting their behavior in accordance with the statistical properties of the data. We demonstrate the benefits of the new paradigm by designing and implementing an adaptive access path operator called Smooth Scan, which morphs continuously within the space of traditional index access and full table scan. Smooth Scan behaves similarly to an index scan for low selectivity; if selectivity increases, however, Smooth Scan progressively morphs its behavior toward a sequential scan. As a result, a system with Smooth Scan requires no access path decisions up front nor does it need accurate statistics to provide good performance. We implement Smooth Scan in PostgreSQL and, using both synthetic benchmarks as well as TPC-H, we show that it achieves robust performance while at the same time being statistics-oblivious.

@inproceedings{Borovica-Gajic2015-mh, abstract = {Query optimizers depend heavily on statistics representing column distributions to create efficient query plans. In many cases, though, statistics are outdated or non-existent, and the process of refreshing statistics is very expensive, especially for ad-hoc workloads on ever bigger data. This results in suboptimal plans that severely hurt performance. The main problem is that any decision, once made by the optimizer, is fixed throughout the execution of a query. In particular, each logical operator translates into a fixed choice of a physical operator at run-time. In this paper, we advocate for continuous adaptation and morphing of physical operators throughout their lifetime, by adjusting their behavior in accordance with the statistical properties of the data. We demonstrate the benefits of the new paradigm by designing and implementing an adaptive access path operator called Smooth Scan, which morphs continuously within the space of traditional index access and full table scan. Smooth Scan behaves similarly to an index scan for low selectivity; if selectivity increases, however, Smooth Scan progressively morphs its behavior toward a sequential scan. As a result, a system with Smooth Scan requires no access path decisions up front nor does it need accurate statistics to provide good performance. We implement Smooth Scan in PostgreSQL and, using both synthetic benchmarks as well as TPC-H, we show that it achieves robust performance while at the same time being statistics-oblivious.}, added-at = {2015-06-12T13:57:33.000+0200}, author = {Borovica-Gajic, Renata and Idreos, Stratos and Ailamaki, Anastasia and Zukowski, Marcin and Fraser, Campbell}, biburl = {https://www.bibsonomy.org/bibtex/27e7d1f719fc9524e220f3688b5fdbb02/christophv}, booktitle = {Data Engineering ({ICDE)}, 2015 {IEEE} 31st International Conference on}, interhash = {d279bfc91d7b481702291ed7b2fe668f}, intrahash = {7e7d1f719fc9524e220f3688b5fdbb02}, keywords = {Complexity_theory Estimation Expose Indexes Probes Query_processing Robustness Switches}, month = apr, pages = {315--326}, timestamp = {2016-01-04T14:22:08.000+0100}, title = {Smooth Scan: Statistics-oblivious access paths}, year = 2015 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Smooth Scan: Statistics-oblivious access paths

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Smooth Scan: Statistics-oblivious access paths

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Smooth Scan: Statistics-oblivious access paths

Comments and Reviews
(0)