PMJoin: Optimizing distributed multi-way stream joins by stream partitioning

被引:0
|
作者
Zhou, Yongluan [1 ]
Yan, Ying
Yu, Feng
Zhou, Aoying
机构
[1] Natl Univ Singapore, Singapore 117548, Singapore
[2] Fudan Univ, Shanghai, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In emerging data stream applications, data sources are typically distributed. Evaluating multi-join queries over streams from different sources may incur large communication cost. As queries run continuously, the precious bandwidths would be aggressively consumed without careful optimization of operator ordering and placement. In this paper, we focus on the optimization of continuous multi-join queries over distributed streams. We observe that by partitioning streams into substreams we can significantly reduce the communication cost and hence propose a novel partition-based join scheme - PMJoin. A few partitioning techniques are studied. To generate the query plan for each substream, a heuristic algorithm is proposed based on a rate-based model. Results from an extensive experimental study show that our techniques can sufficiently reduce the communication cost.
引用
收藏
页码:325 / 341
页数:17
相关论文
共 50 条
  • [41] Multi-way R-tree joins using indirect predicates
    Park, HH
    Min, JK
    Chung, CW
    Chang, TG
    INFORMATION AND SOFTWARE TECHNOLOGY, 2004, 46 (11) : 739 - 751
  • [42] Adaptive Optimisation For Continuous Multi-Way Joins Over RDF Streams
    Danh Le-Phuoc
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1857 - 1865
  • [43] MML inference of decision graphs with multi-way joins and dynamic attributes
    Tan, PJ
    Dowe, DL
    AI 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2003, 2903 : 269 - 281
  • [44] Multi-Way Windowed Streams theta-Joins Using Cluster
    Liu, Xinchun
    Li, Jing
    Fan, Xiaopeng
    Chen, Jun
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2016, 9 (02): : 103 - 120
  • [45] Evaluating Negation with Multi-way Joins Accelerates Class Expression Learning
    Karalis, Nikolaos
    Bigerl, Alexander
    Demir, Caglar
    Heidrich, Liss
    Ngomo, Axel-Cyrille Ngonga
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES-RESEARCH TRACK, PT VI, ECML PKDD 2024, 2024, 14946 : 199 - 216
  • [46] COLA: Optimizing Stream Processing Applications via Graph Partitioning
    Khandekar, Rohit
    Hildrum, Kirsten
    Parekh, Sujay
    Rajan, Deepak
    Wolf, Joel
    Wu, Kun-Lung
    Andrade, Henrique
    Gedik, Bugra
    MIDDLEWARE 2009, PROCEEDINGS, 2009, 5896 : 308 - 327
  • [47] Duality-Based Locality-Aware Stream Partitioning in Distributed Stream Processing Engines
    Son, Siwoon
    Moon, Yang-Sae
    EURO-PAR 2019: PARALLEL PROCESSING WORKSHOPS, 2020, 11997 : 725 - 730
  • [48] Iterative improvement based multi-way netlist partitioning for FPGAs
    Krupnova, H
    Saucier, G
    DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION 1999, PROCEEDINGS, 1999, : 587 - 594
  • [49] Multi-way FPGA partitioning by fully exploiting design hierarchy
    Fang, WJ
    Wu, ACH
    DESIGN AUTOMATION CONFERENCE - PROCEEDINGS 1997, 1997, : 518 - 521
  • [50] An efficient multi-way algorithm for balanced partitioning of VLSI circuits
    Tan, X
    Tong, J
    Tan, P
    Park, N
    Lombardi, F
    INTERNATIONAL CONFERENCE ON COMPUTER DESIGN - VLSI IN COMPUTERS AND PROCESSORS, PROCEEDINGS, 1997, : 608 - 613