Sketching asynchronous data streams over sliding windows

被引:0
|
作者
Bojian Xu
Srikanta Tirthapura
Costas Busch
机构
[1] Iowa State University,Department of Electrical and Computer Engineering
[2] Louisiana State University,Department of Computer Science
来源
Distributed Computing | 2008年 / 20卷
关键词
Data streams; Asynchronous streams; Distributed streams; Sliding window; Sum; Median;
D O I
暂无
中图分类号
学科分类号
摘要
We study the problem of maintaining a sketch of recent elements of a data stream. Motivated by applications involving network data, we consider streams that are asynchronous, in which the observed order of data is not the same as the time order in which the data was generated. The notion of recent elements of a stream is modeled by the sliding timestamp window, which is the set of elements with timestamps that are close to the current time. We design algorithms for maintaining sketches of all elements within the sliding timestamp window that can give provably accurate estimates of two basic aggregates, the sum and the median, of a stream of numbers. The space taken by the sketches, the time needed for querying the sketch, and the time for inserting new elements into the sketch are all polylogarithmic with respect to the maximum window size. Our sketches can be easily combined in a lossless and compact way, making them useful for distributed computations over data streams. Previous works on sketching recent elements of a data stream have all considered the more restrictive scenario of synchronous streams, where the observed order of data is the same as the time order in which the data was generated. Our notion of recency of elements is more general than that studied in previous work, and thus our sketches are more robust to network delays and asynchrony.
引用
收藏
页码:359 / 374
页数:15
相关论文
共 50 条
  • [1] Sketching asynchronous data streams over sliding windows
    Xu, Bojian
    Tirthapura, Srikanta
    Busch, Costas
    [J]. DISTRIBUTED COMPUTING, 2008, 20 (05) : 359 - 374
  • [2] Sliding windows over uncertain data streams
    Dallachiesa, Michele
    Jacques-Silva, Gabriela
    Gedik, Bugra
    Wu, Kun-Lung
    Palpanas, Themis
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 45 (01) : 159 - 190
  • [3] Sliding windows over uncertain data streams
    Michele Dallachiesa
    Gabriela Jacques-Silva
    Buğra Gedik
    Kun-Lung Wu
    Themis Palpanas
    [J]. Knowledge and Information Systems, 2015, 45 : 159 - 190
  • [4] Matrix Sketching Over Sliding Windows
    Wei, Zhewei
    Liu, Xuancheng
    Li, Feifei
    Shang, Shuo
    Du, Xiaoyong
    Wen, Ji-Rong
    [J]. SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 1465 - 1480
  • [5] Clustering Data Streams over Sliding Windows by DCA
    Ta Minh Thuy
    Le Thi Hoai An
    Boudjeloud-Assala, Lydia
    [J]. ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, 2013, 479 : 65 - 75
  • [6] Dynamic adjustment of sliding windows over data streams
    Zhang, DD
    Li, JZ
    Zhang, ZG
    Wang, WP
    Guo, LJ
    [J]. ADVANCES IN WEB-AGE INFORMATION MANAGEMENT: PROCEEDINGS, 2004, 3129 : 24 - 33
  • [7] On indexing sliding windows over online data streams
    Golab, L
    Garg, S
    Özsu, MT
    [J]. ADVANCES IN DATABASE TECHNOLOGY - EDBT 2004, PROCEEDINGS, 2004, 2992 : 712 - 729
  • [8] Efficiently Summarizing Data Streams over Sliding Windows
    Rivetti, Nicolo
    Busnel, Yann
    Mostefaoui, Achour
    [J]. 2015 IEEE 14th International Symposium on Network Computing and Applications (NCA), 2015, : 151 - 158
  • [9] Optimal Matrix Sketching over Sliding Windows
    Yin, Hanyan
    Wen, Dongxie
    Li, Jiajun
    Wei, Zhewei
    Zhang, Xiao
    Huang, Zengfeng
    Li, Feifei
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (09): : 2149 - 2161
  • [10] Querying sliding windows over on-line data streams
    Golab, Lukasz
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2004, 3268 : 1 - 11