Data processing systems face the challenge of supporting increasingly diverse workloads efficiently. At the same time, they are already bloated with internal complexity, and it is not clear how new hardware can be supported sustainably. In this paper, we aim to resolve these issues by proposing a unified abstraction layer based on declarative sub-operators in addition to relational operators. By exposing this layer to users, they can express their non-relational workloads declaratively with sub-operators. Furthermore, the proposed sub-operators decouple the semantic implementation of operators from the efficient imperative implementation, reducing the implementation complexity for relational operators. Finally, through fine-grained automatic optimizations, the declarative sub-operators allow for automatic morsel-driven parallelism. We demonstrate the benefits not only by providing a specific set of sub-operators but also implementing them in a compiling query engine. With thorough evaluation and analysis, we show that we can support a richer set of workloads while retaining the development complexity low and being competitive in performance even with specialized systems.
机构:
Aix Marseille Univ, Inst Math, UMR 7373, 39 Rue F Joliot Curie, F-13453 Marseille 13, FranceAix Marseille Univ, Inst Math, UMR 7373, 39 Rue F Joliot Curie, F-13453 Marseille 13, France
Charpentier, S.
Mouze, A.
论文数: 0引用数: 0
h-index: 0
机构:
Lab Paul Painleve, UMR 8524, Cite Sci, F-59650 Villeneuve Dascq, France
Ecole Cent Lille, Cite Sci,CS20048, F-59651 Villeneuve Dascq, FranceAix Marseille Univ, Inst Math, UMR 7373, 39 Rue F Joliot Curie, F-13453 Marseille 13, France