We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Data-induced predicates for sideways information passing in query optimizers.
- Authors
Kandula, Srikanth; Orr, Laurel; Chaudhuri, Surajit
- Abstract
Using data statistics, we convert predicates on a table into data-induced predicates (diPs) that apply on the joining tables. Doing so substantially speeds up multi-relation queries because the benefits of predicate pushdown can now apply beyond just the tables that have predicates. We use diPs to skip data exclusively during query optimization; i.e., diPs lead to better plans and have no overhead during query execution. We study how to apply diPs for complex query expressions and how the usefulness of diPs varies with the data statistics used to construct diPs and the data distributions. Our results show that building diPs using zone-maps which are already maintained in today's clusters leads to sizable data skipping gains. Using a new (slightly larger) statistic, 50% of the queries in the TPC-H, TPC-DS and JoinOrder benchmarks can skip at least 33% of the query input. Consequently, the median query in a production big-data cluster finishes roughly 2 × faster.
- Subjects
DATA distribution
- Publication
VLDB Journal International Journal on Very Large Data Bases, 2022, Vol 31, Issue 6, p1263
- ISSN
1066-8888
- Publication type
Article
- DOI
10.1007/s00778-021-00693-2