PMJoin: Optimizing distributed multi-way stream joins by stream partitioning

被引:0
|
作者
Zhou, Yongluan [1 ]
Yan, Ying
Yu, Feng
Zhou, Aoying
机构
[1] Natl Univ Singapore, Singapore 117548, Singapore
[2] Fudan Univ, Shanghai, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In emerging data stream applications, data sources are typically distributed. Evaluating multi-join queries over streams from different sources may incur large communication cost. As queries run continuously, the precious bandwidths would be aggressively consumed without careful optimization of operator ordering and placement. In this paper, we focus on the optimization of continuous multi-join queries over distributed streams. We observe that by partitioning streams into substreams we can significantly reduce the communication cost and hence propose a novel partition-based join scheme - PMJoin. A few partitioning techniques are studied. To generate the query plan for each substream, a heuristic algorithm is proposed based on a rate-based model. Results from an extensive experimental study show that our techniques can sufficiently reduce the communication cost.
引用
收藏
页码:325 / 341
页数:17
相关论文
共 50 条
  • [21] Faster joins, self-joins and multi-way joins using join indices
    Lei, Hui
    Ross, Kenneth A.
    Data and Knowledge Engineering, 1999, 29 (02): : 179 - 200
  • [22] An Evaluation of Multi-way Joins for Relational Database Systems
    Henderson, Michael
    Lawrence, Ramon
    ENTERPRISE INFORMATION SYSTEMS, ICEIS 2013, 2014, 190 : 37 - 50
  • [23] Considering Data Skew in Multi-way Joins for MapReduce
    Wu, Lei
    Zhang, Changchun
    Meng, Haiyan
    Li, Jing
    2013 8TH CHINAGRID ANNUAL CONFERENCE (CHINAGRID), 2013, : 69 - 73
  • [24] Residual Sensitivity for Differentially Private Multi-Way Joins
    Dong, Wei
    Yi, Ke
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 432 - 444
  • [25] MML inference of decision graphs with multi-way joins
    Tan, PJ
    Dowe, DL
    AL 2002: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2002, 2557 : 131 - 142
  • [26] Graph partition based multi-way spatial joins
    Lin, XM
    Lu, HX
    Zhang, Q
    IDEAS 2002: INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2002, : 23 - 32
  • [27] On estimating result sizes of multi-way spatial joins
    Park, HH
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCA 2003, PT 3, PROCEEDINGS, 2003, 2669 : 856 - 865
  • [28] A distributed multilevel ant-colony algorithm for the multi-way graph partitioning
    Tashkova, K.
    Korosec, P.
    Silc, J.
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2011, 3 (05) : 286 - 296
  • [29] Load Distribution for Multi-Way Streams Joins Using Cluster
    Liu, Xinchun
    Li, Jing
    Fan, Xiaopeng
    2014 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT, 2014,
  • [30] Adaptive key partitioning in distributed stream processing
    Gang Liu
    Zeting Wang
    Amelie Chi Zhou
    Rui Mao
    CCF Transactions on High Performance Computing, 2024, 6 : 164 - 178