PMJoin: Optimizing distributed multi-way stream joins by stream partitioning

被引:0
|
作者
Zhou, Yongluan [1 ]
Yan, Ying
Yu, Feng
Zhou, Aoying
机构
[1] Natl Univ Singapore, Singapore 117548, Singapore
[2] Fudan Univ, Shanghai, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In emerging data stream applications, data sources are typically distributed. Evaluating multi-join queries over streams from different sources may incur large communication cost. As queries run continuously, the precious bandwidths would be aggressively consumed without careful optimization of operator ordering and placement. In this paper, we focus on the optimization of continuous multi-join queries over distributed streams. We observe that by partitioning streams into substreams we can significantly reduce the communication cost and hence propose a novel partition-based join scheme - PMJoin. A few partitioning techniques are studied. To generate the query plan for each substream, a heuristic algorithm is proposed based on a rate-based model. Results from an extensive experimental study show that our techniques can sufficiently reduce the communication cost.
引用
收藏
页码:325 / 341
页数:17
相关论文
共 50 条
  • [1] Optimizing Multiple Multi-Way Stream Joins
    Dossinger, Manuel
    Michel, Sebastian
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1985 - 1990
  • [2] Query processing of multi-way stream window joins
    Moustafa A. Hammad
    Walid G. Aref
    Ahmed K. Elmagarmid
    The VLDB Journal, 2008, 17 : 469 - 488
  • [3] Query processing of multi-way stream window joins
    Hammad, Moustafa A.
    Aref, Walid G.
    Elmagarmid, Ahmed K.
    VLDB JOURNAL, 2008, 17 (03): : 469 - 488
  • [4] A Scalable Circular Pipeline Design for Multi-Way Stream Joins in Hardware
    Najafi, Mohammadreza
    Sadoghi, Mohammad
    Jacobsen, Hans-Arno
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1280 - 1283
  • [5] Load shedding for multi-way stream joins based on arrival order patterns
    Kwon, Tae-Hyung
    Lee, Ki Yong
    Kim, Myoung Ho
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2011, 37 (02) : 245 - 265
  • [6] Scaling Out Multi-Way Stream Joins using Optimized, Iterative Probing
    Dossinger, Manuel
    Michel, Sebastian
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 449 - 456
  • [7] Load shedding for multi-way stream joins based on arrival order patterns
    Tae-Hyung Kwon
    Ki Yong Lee
    Myoung Ho Kim
    Journal of Intelligent Information Systems, 2011, 37 : 245 - 265
  • [8] Optimizing Multi-Way Spatial Joins of Web Feature Services
    Lan, Guiwen
    Zhang, Qiang
    Yang, Zhao
    Li, Tong
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2017, 6 (04)
  • [9] CLASH: A High-Level Abstraction for Optimized, Multi-Way Stream Joins over Apache Storm
    Dossinger, Manuel
    Michel, Sebastian
    Roudsarabi, Constantin
    SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, : 1897 - 1900
  • [10] Optimizing Multi-way Theta Join for Data Skew in Sub-second Stream Computing
    Fan, Xiaopeng
    Liu, Xinchun
    Wang, Yang
    Wang, Youjun
    Li, Jing
    2020 IEEE 26TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2020, : 476 - 485