Random sampling algorithms for sliding windows over data streams

被引:3
|
作者
Zhang, LB [1 ]
Li, ZH [1 ]
Yu, M [1 ]
Wang, Y [1 ]
Jiang, Y [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shannxi, Peoples R China
关键词
data streams; random sampling; sliding window; approximate algorithm;
D O I
10.1142/9789812701534_0129
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are growing interests in algorithms over data streams recently. This paper introduces the problem of sampling from sliding windows of recent data items from data streams and presents two random sampling algorithms for this problem. The first algorithm is a basic window-based sampling algorithm (BWRS Algorithm) for count-based sliding window. BWRS algorithm extends classic reservoir sampling to deal with the expiration of data elements from count-based sliding window, and can avoid drawbacks of classic reservoir sampling. The second algorithm is a stratified multistage sampling algorithm for time-based sliding window (SMS Algorithm). The SMS algorithm takes different sampling fraction in different strata from time-based sliding window, and works even when the number of data items in the sliding window varies dynamically over time. The theoretic analysis and experiments show that the algorithms are effective and efficient for continuous data streams processing.
引用
收藏
页码:572 / 575
页数:4
相关论文
共 50 条
  • [1] Random sampling algorithms for landmark windows over data streams
    Zhang Longbo
    Li Zhanhuai
    Yu Min
    Wang Yong
    Jiang Yun
    [J]. ICEIS 2006: PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATIONAL SYSTEMS: DATABASES AND INFORMATION SYSTEMS INTEGRATION, 2006, : 103 - +
  • [2] A performance evaluation of data streams sampling algorithms over a sliding window
    El Sibai, Rayane
    Chabchoub, Yousra
    Demerjian, Jacques
    Chiky, Raja
    Barbar, Kablan
    [J]. 2018 IEEE MIDDLE EAST AND NORTH AFRICA COMMUNICATIONS CONFERENCE (MENACOMM), 2018, : 211 - 216
  • [3] Sliding windows over uncertain data streams
    Dallachiesa, Michele
    Jacques-Silva, Gabriela
    Gedik, Bugra
    Wu, Kun-Lung
    Palpanas, Themis
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 45 (01) : 159 - 190
  • [4] Sliding windows over uncertain data streams
    Michele Dallachiesa
    Gabriela Jacques-Silva
    Buğra Gedik
    Kun-Lung Wu
    Themis Palpanas
    [J]. Knowledge and Information Systems, 2015, 45 : 159 - 190
  • [5] New sampling-based summary structures for sliding windows over data streams
    Zhang, Longbo
    Li, Zhanhuai
    Yu, Min
    Zhao, Guangyuan
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2007, 2 : 1242 - +
  • [6] Sketching asynchronous data streams over sliding windows
    Bojian Xu
    Srikanta Tirthapura
    Costas Busch
    [J]. Distributed Computing, 2008, 20 : 359 - 374
  • [7] On indexing sliding windows over online data streams
    Golab, L
    Garg, S
    Özsu, MT
    [J]. ADVANCES IN DATABASE TECHNOLOGY - EDBT 2004, PROCEEDINGS, 2004, 2992 : 712 - 729
  • [8] Efficiently Summarizing Data Streams over Sliding Windows
    Rivetti, Nicolo
    Busnel, Yann
    Mostefaoui, Achour
    [J]. 2015 IEEE 14th International Symposium on Network Computing and Applications (NCA), 2015, : 151 - 158
  • [9] Sketching asynchronous data streams over sliding windows
    Xu, Bojian
    Tirthapura, Srikanta
    Busch, Costas
    [J]. DISTRIBUTED COMPUTING, 2008, 20 (05) : 359 - 374
  • [10] Dynamic adjustment of sliding windows over data streams
    Zhang, DD
    Li, JZ
    Zhang, ZG
    Wang, WP
    Guo, LJ
    [J]. ADVANCES IN WEB-AGE INFORMATION MANAGEMENT: PROCEEDINGS, 2004, 3129 : 24 - 33