Monitoring Distributed Streams using Convex Decompositions

被引:15
|
作者
Lazerson, Arnon [1 ]
Sharfman, Izchak [1 ]
Keren, Daniel [2 ]
Schuster, Assaf [1 ]
Garofalakis, Minos [3 ]
Samoladas, Vasilis [3 ]
机构
[1] Israeli Inst Technol, Fac Comp Sci, Tel Aviv, Israel
[2] Univ Haifa, Dept Comp Sci, IL-31999 Haifa, Israel
[3] Tech Univ Crete, Sch Elect & Comp Engn, Iraklion, Greece
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2015年 / 8卷 / 05期
关键词
D O I
10.14778/2735479.2735487
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Emerging large-scale monitoring applications rely on continuous tracking of complex data-analysis queries over collections of massive, physically-distributed data streams. Thus, in addition to the space-and time-efficiency requirements of conventional stream-processing (at each remote monitor site), effective solutions also need to guarantee communication efficiency (over the underlying communication network). The complexity of the monitored query adds to the difficulty of the problem - this is especially true for nonlinear queries (e.g., joins), where no obvious solutions exist for distributing the monitored condition across sites. The recently proposed geometric method, based on the notion of covering spheres, offers a generic methodology for splitting an arbitrary (non-linear) global condition into a collection of local site constraints, and has been applied tomassive distributed stream-monitoring tasks, achieving state-of-the-art performance. In this paper, we present a far more general geometric approach, based on the convex decomposition of an appropriate subset of the domain of the monitoring query, and formally prove that it is always guaranteed to perform at least as good as the covering spheres method. We analyze our approach and demonstrate its effectiveness for the important case of sketch-based approximate tracking for norm, range-aggregate, and join-aggregate queries, which have numerous applications in streaming data analysis. Experimental results on real-life data streams verify the superiority of our approach in practical settings, showing that it substantially outperforms the covering spheres method.
引用
收藏
页码:545 / 556
页数:12
相关论文
共 50 条
  • [21] Sampling from Convex Sets with a Cold Start using Multiscale Decompositions
    Narayanan, Hariharan
    Rajaraman, Amit
    Srivastava, Piyush
    PROCEEDINGS OF THE 55TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, STOC 2023, 2023, : 117 - 130
  • [22] SCHAUDER DECOMPOSITIONS IN LOCALLY CONVEX SPACES
    KALTON, NJ
    PROCEEDINGS OF THE CAMBRIDGE PHILOSOPHICAL SOCIETY-MATHEMATICAL AND PHYSICAL SCIENCES, 1970, 68 : 377 - &
  • [23] OPEN DECOMPOSITIONS ON ORDERED CONVEX SPACES
    WONG, YC
    PROCEEDINGS OF THE CAMBRIDGE PHILOSOPHICAL SOCIETY, 1973, 74 (JUL): : 49 - 59
  • [24] Locally convex spaces with Toeplitz decompositions
    Paúl, PJ
    Sáez, C
    Virués, JM
    JOURNAL OF THE AUSTRALIAN MATHEMATICAL SOCIETY SERIES A-PURE MATHEMATICS AND STATISTICS, 2000, 68 : 19 - 40
  • [25] CONVEX DECOMPOSITIONS AND THE VALENCE OF SOME FUNCTIONS
    Pintea, Cornel
    Tofan, Adrian
    JOURNAL OF NONLINEAR AND VARIATIONAL ANALYSIS, 2020, 4 (02): : 225 - 239
  • [26] On convex decompositions of a planar point set
    Hosono, Kiyoshi
    DISCRETE MATHEMATICS, 2009, 309 (06) : 1714 - 1717
  • [28] Distributed Estimation using Square Root Decompositions of Dependent Information
    Radtke, Susanne
    Noack, Benjamin
    Hanebeck, Uwe D.
    2019 22ND INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2019), 2019,
  • [29] Visualization of cracks by using the local Voronoi decompositions and distributed software
    Pacevic, Ruslan
    Kaceniauskas, Arnas
    Markauskas, Darius
    ADVANCES IN ENGINEERING SOFTWARE, 2015, 84 : 85 - 94
  • [30] Online Top-k-Position Monitoring of Distributed Data Streams
    Maecker, Alexander
    Malatyali, Manuel
    Heide, Friedhelm Meyer Auf Der
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 357 - 364