Many-query join: efficient shared execution of relational joins on modern hardware
OPEN ACCESS
Loading...
Author / Producer
Date
2018-10
Publication Type
Journal Article
ETH Bibliography
yes
Citations
Altmetric
OPEN ACCESS
Data
Rights / License
Abstract
Database architectures typically process queries one at a time, executing concurrent queries in independent execution contexts. Often, such a design leads to unpredictable performance and poor scalability. One approach to circumvent the problem is to take advantage of sharing opportunities across concurrently running queries. In this paper, we propose many-query join (MQJoin), a novel method for sharing the execution of a join that can efficiently deal with hundreds of concurrent queries. This is achieved by minimizing redundant work and making efficient use of main-memory bandwidth and multi-core architectures. Compared to existing proposals, MQJoin is able to efficiently handle larger workloads regardless of the schema by exploiting more sharing opportunities. We also compared MQJoin to two commercial main-memory column-store databases. For a TPC-H-based workload, we show that MQJoin provides 2–5× higher throughput with significantly more stable response times.
Permanent link
Publication status
published
External links
Editor
Book title
Journal / series
Volume
27 (5)
Pages / Article No.
669 - 692
Publisher
Springer
Event
Edition / version
Methods
Software
Geographic location
Date collected
Date created
Subject
RDBMS; OLAP; Analytics; Join; MQJoin; Shared join; Main Memory; TPC-H; Xeon Phi; MCDRAM
Organisational unit
03506 - Alonso, Gustavo / Alonso, Gustavo
Notes
It was possible to publish this article open access thanks to a Swiss National Licence with the publisher.