Continuously monitoring top-k uncertain data streams: a probabilistic threshold method

被引:9
|
作者
Hua, Ming [1 ]
Pei, Jian [1 ]
机构
[1] Simon Fraser Univ, Burnaby, BC V5A 1S6, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Uncertain streams; Probabilistic threshold top-k queries; Query processing; SELECTION; QUERIES;
D O I
10.1007/s10619-009-7043-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, uncertain data processing has become more and more important. Although a significant amount of previous research explores various continuous queries on data streams, continuous queries on uncertain data streams have seldom been investigated. In this paper, we formulate a novel and challenging problem of continuously monitoring top-k uncertain data streams, and propose a probabilistic threshold method. We develop four algorithms systematically: a deterministic exact algorithm, a randomized method, and their space-efficient versions using quantile summaries. An extensive empirical study using real data sets and synthetic data sets is reported to verify the effectiveness and the efficiency of our methods.
引用
收藏
页码:29 / 65
页数:37
相关论文
共 50 条
  • [1] Continuously monitoring top-k uncertain data streams: a probabilistic threshold method
    Ming Hua
    Jian Pei
    [J]. Distributed and Parallel Databases, 2009, 26 : 29 - 65
  • [2] Efficiently answering probabilistic threshold top-k queries on uncertain data
    Hua, Ming
    Pei, Jian
    Zhang, Wenjie
    Lin, Xuemin
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1403 - +
  • [3] Continuous Monitoring of Top-k Dominating Queries over Uncertain Data Streams
    Li, Guohui
    Luo, Changyin
    Li, Jianjun
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2014, PT I, 2014, 8786 : 244 - 255
  • [4] Probabilistic Top-k Dominating Query Monitoring Over Multiple Uncertain IoT Data Streams in Edge Computing Environments
    Lai, Chuan-Chi
    Wang, Tien-Chun
    Liu, Chuan-Ming
    Wang, Li-Chun
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (05): : 8563 - 8576
  • [5] An efficient algorithm for top-k queries on uncertain data streams
    Dai, Caiyan
    Chen, Ling
    Chen, Yixin
    Tang, Keming
    [J]. 2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 294 - 299
  • [6] An Effective Method for Top-k Dominating Query Processing over Multiple Uncertain Data Streams
    Liu, Chuan-Ming
    Wang, Tien-Chun
    Lai, Chuan-Chi
    Wang, Li-Chun
    [J]. 2018 27TH WIRELESS AND OPTICAL COMMUNICATION CONFERENCE (WOCC), 2018, : 91 - 95
  • [7] Continuous Top-k Monitoring on Document Streams
    Hou, Leong U.
    Zhang, Junjie
    Mouratidis, Kyriakos
    Li, Ye
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (05) : 991 - 1003
  • [8] Sliding Window Top-K Monitoring over Distributed Data Streams
    Lv, Zhijin
    Chen, Ben
    Yu, Xiaohui
    [J]. WEB AND BIG DATA, APWEB-WAIM 2017, PT I, 2017, 10366 : 527 - 540
  • [9] Probabilistic top-k dominating queries in uncertain databases
    Lian, Xiang
    Chen, Lei
    [J]. INFORMATION SCIENCES, 2013, 226 : 23 - 46
  • [10] Sliding Window Top-K Monitoring over Distributed Data Streams
    Chen B.
    Lv Z.
    Yu X.
    Liu Y.
    [J]. Data Science and Engineering, 2017, 2 (4) : 289 - 300