Load shedding for window joins on multiple data streams

被引:4
|
作者
Law, Yan-Nei [1 ]
Zaniolo, Carlo [2 ]
机构
[1] Bioinformat Inst, 30 Biopolis St, Singapore 138671, Singapore
[2] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
关键词
D O I
10.1109/ICDEW.2007.4401054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of semantic load shedding for continuous queries containing window joins on multiple data streams and propose a robust approach that is effective with the different semantic accuracy criteria that are required in different applications. In fact, our approach can be used to (i) maximize the number of output tuples produced by joins, and (ii) optimize the accuracy of complex aggregates estimates under uniform random sampling. We first consider the problem of computing maximal subsets of approximate window joins over multiple data streams. Previously proposed approaches are based on multiple pair-wise joins and, in their load-shedding decisions, disregard the content of streams outside the joined pairs. To overcome these limitations, we optimize our load-shedding policy using various predictors of the productivity of each tuple in the window. To minimize processing costs, we use a fast and-light sketching technique to estimate the productivity of the tuples. We then show that our method can be generalized to produce statistically accurate samples, as needed in, e.g., the computation of averages, quantiles, and stream mining queries. Tests performed on both synthetic and real-life data demonstrate that our method outperforms previous approaches, while requiring comparable amounts of time and space.
引用
收藏
页码:674 / +
页数:2
相关论文
共 50 条
  • [1] Load shedding for window joins over streams
    Han, Donghong
    Xiao, Chuan
    Zhou, Rui
    Wang, Guoren
    Huo, Huan
    Hui, Xiaoyun
    [J]. ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2006, 4016 : 472 - 483
  • [2] Load shedding for window joins over streams
    Han, Dong-Hong
    Wang, Guo-Ren
    Xiao, Chuan
    Zhou, Rui
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2007, 22 (02) : 182 - 189
  • [3] Load Shedding for Window Joins over Streams
    Dong-Hong Han
    Guo-Ren Wang
    Chuan Xiao
    Rui Zhou
    [J]. Journal of Computer Science and Technology, 2007, 22 : 182 - 189
  • [4] Hardware processor for window joins over multiple data streams
    School of Information Science and Engineering, Ningbo University, Ningbo 315211, China
    不详
    [J]. Tien Tzu Hsueh Pao, 2009, 2 (404-409): : 404 - 409
  • [5] FPGA acceleration window joins over multiple data streams
    Qian, JB
    Xu, HB
    Dong, YS
    Liu, XJ
    Wang, YL
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2005, 14 (04) : 813 - 830
  • [6] Load shedding for window queries over continuous data streams
    Kim, Kwang Rak
    Kim, Hyeon Gyu
    [J]. Lecture Notes in Electrical Engineering, 2015, 373 : 159 - 164
  • [7] Efficient load shedding for streaming sliding window joins
    Ren, Jia-Dong
    Jiang, Wan-Chang
    Huo, Cong
    [J]. PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 1536 - 1541
  • [8] Index-based load shedding for streaming sliding window joins
    Ren, Jiadong
    Jiang, Wanchang
    Huo, Cong
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2007, 2 : 162 - +
  • [9] Load Shedding for Shared Window Join over Real-Time Data Streams
    Ma, Li
    Liang, Dangwei
    Zhang, Qiongsheng
    Li, Xin
    Wang, Hongan
    [J]. ADVANCES IN DATA AND WEB MANAGEMENT, PROCEEDINGS, 2009, 5446 : 590 - +
  • [10] Adaptive scheduling for shared window joins over data streams
    Jin C.
    Zhou A.
    Yu J.X.
    Huang J.Z.
    Cao F.
    [J]. Frontiers of Computer Science in China, 2007, 1 (4): : 468 - 477