Sliding-Window Top-k Queries on Uncertain Streams

被引:47
|
作者
Jin, Cheqing [1 ]
Yi, Ke [2 ]
Chen, Lei [2 ]
Yu, Jeffrey Xu [3 ]
Lin, Xuemin [4 ,5 ]
机构
[1] East China Univ Sci & Technol, Shanghai, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Hong Kong, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
[4] Univ New South Wales, Sydney, NSW, Australia
[5] Nat ICT Australia, Sydney, NSW, Australia
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2008年 / 1卷 / 01期
关键词
Data handling - Information retrieval - Query processing;
D O I
10.14778/1453856.1453892
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query processing on uncertain data streams has attracted a lot of attentions lately, due to the imprecise nature in the data generated from a variety of streaming applications, such as readings from a sensor network. However, all of the existing works on uncertain data streams study unbounded streams. This paper takes the first step towards the important and challenging problem of answering sliding-window queries on uncertain data streams, with a focus on arguably one of the most important types of queries top-k queries. The challenge of answering sliding-window top-k queries on uncertain data streams stems from the strict space and time requirements of processing both arriving and expiring tuples in high-speed streams, combined with the difficulty of coping with the exponential blowup in the number of possible worlds induced by the uncertain data model. In this paper, we design a unified framework for processing sliding-window top-k queries on uncertain streams. We show that all the existing top-k definitions in the literature can be plugged into our framework, resulting in several succinct synopses that use space much smaller than the window size, while are also highly efficient in terms of processing time. In addition to the theoretical space and time bounds that we prove for these synopses, we also present a thorough experimental report to verify their practical efficiency on both synthetic and real data.
引用
收藏
页码:301 / 312
页数:12
相关论文
共 50 条
  • [31] Finding Top-k Most Frequent Items in Distributed Streams in the Time-Sliding Window Model
    Anceaume, Emmanuelle
    Busnel, Yann
    Cazacu, Vasile
    [J]. 2018 48TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOPS (DSN-W), 2018, : 61 - 62
  • [32] Efficient Algorithms for Skyline Top-K Keyword Queries on XML Streams
    Li, Lingli
    Wang, Hongzhi
    Li, Jianzhong
    Gao, Hong
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2009, 5463 : 283 - 287
  • [33] Mining top-k frequent closed itemsets over data streams using the sliding window model
    Tsai, Pauray S. M.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (10) : 6968 - 6973
  • [34] Approximate Continuous Top-k Query over Sliding Window
    Rui Zhu
    Bin Wang
    Shi-Ying Luo
    Xiao-Chun Yang
    Guo-Ren Wang
    [J]. Journal of Computer Science and Technology, 2017, 32 : 93 - 109
  • [35] Approximate Continuous Top-k Query over Sliding Window
    Zhu, Rui
    Wang, Bin
    Luo, Shi-Ying
    Yang, Xiao-Chun
    Wang, Guo-Ren
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 32 (01) : 93 - 109
  • [36] Reverse Top-k Queries
    Vlachou, Akrivi
    Doulkeridis, Christos
    Kotidis, Yannis
    Norvag, Kjetil
    [J]. 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 365 - 376
  • [37] Sketching distributed sliding-window data streams
    Odysseas Papapetrou
    Minos Garofalakis
    Antonios Deligiannakis
    [J]. The VLDB Journal, 2015, 24 : 345 - 368
  • [38] Finding top-k elements in a time-sliding window
    Homem N.
    Carvalho J.P.
    [J]. Evolving Systems, 2011, 2 (01) : 51 - 70
  • [39] Distributed probabilistic top-k dominating queries over uncertain databases
    Niranjan Rai
    Xiang Lian
    [J]. Knowledge and Information Systems, 2023, 65 : 4939 - 4965
  • [40] Distributed probabilistic top-k dominating queries over uncertain databases
    Rai, Niranjan
    Lian, Xiang
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (11) : 4939 - 4965