Many-query join: efficient shared execution of relational joins on modern hardware


Loading...

Date

2018-10

Publication Type

Journal Article

ETH Bibliography

yes

Citations

Altmetric

Data

Abstract

Database architectures typically process queries one at a time, executing concurrent queries in independent execution contexts. Often, such a design leads to unpredictable performance and poor scalability. One approach to circumvent the problem is to take advantage of sharing opportunities across concurrently running queries. In this paper, we propose many-query join (MQJoin), a novel method for sharing the execution of a join that can efficiently deal with hundreds of concurrent queries. This is achieved by minimizing redundant work and making efficient use of main-memory bandwidth and multi-core architectures. Compared to existing proposals, MQJoin is able to efficiently handle larger workloads regardless of the schema by exploiting more sharing opportunities. We also compared MQJoin to two commercial main-memory column-store databases. For a TPC-H-based workload, we show that MQJoin provides 2–5× higher throughput with significantly more stable response times.

Publication status

published

Editor

Book title

Volume

27 (5)

Pages / Article No.

669 - 692

Publisher

Springer

Event

Edition / version

Methods

Software

Geographic location

Date collected

Date created

Subject

RDBMS; OLAP; Analytics; Join; MQJoin; Shared join; Main Memory; TPC-H; Xeon Phi; MCDRAM

Organisational unit

03506 - Alonso, Gustavo / Alonso, Gustavo check_circle

Notes

It was possible to publish this article open access thanks to a Swiss National Licence with the publisher.

Funding

Related publications and datasets