CLARO: modeling and processing uncertain data streams

被引:28
|
作者
Tran, Thanh T. L. [1 ]
Peng, Liping [1 ]
Diao, Yanlei [1 ]
McGregor, Andrew [1 ]
Liu, Anna [1 ]
机构
[1] Univ Massachusetts, Amherst, MA 01003 USA
来源
VLDB JOURNAL | 2012年 / 21卷 / 05期
基金
美国国家科学基金会;
关键词
Uncertain data streams; Continuous uncertainty; Data models; Query processing;
D O I
10.1007/s00778-011-0261-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Uncertain data streams, where data are incomplete and imprecise, have been observed in many environments. Feeding such data streams to existing stream systems produces results of unknown quality, which is of paramount concern to monitoring applications. In this paper, we present the claro system that supports stream processing for uncertain data naturally captured using continuous random variables. claro employs a unique data model that is flexible and allows efficient computation. Built on this model, we develop evaluation techniques for relational operators by exploring statistical theory and approximation. We also consider query planning for complex queries given an accuracy requirement. Evaluation results show that our techniques can achieve high performance while satisfying accuracy requirements and outperform state-of-the-art sampling methods.
引用
收藏
页码:651 / 676
页数:26
相关论文
共 50 条
  • [1] CLARO: modeling and processing uncertain data streams
    Thanh T. L. Tran
    Liping Peng
    Yanlei Diao
    Andrew McGregor
    Anna Liu
    [J]. The VLDB Journal, 2012, 21 : 651 - 676
  • [2] Similarity Join Processing on Uncertain Data Streams
    Lian, Xiang
    Chen, Lei
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (11) : 1718 - 1734
  • [3] Complex Event Processing on Uncertain Data Streams in Product Manufacturing Process
    Mao, Na
    Tan, Jie
    [J]. 2015 INTERNATIONAL CONFERENCE ON ADVANCED MECHATRONIC SYSTEMS (ICAMECHS), 2015, : 583 - 588
  • [4] Modeling Randomized Data Streams in Caching, Data Processing, and Crawling Applications
    Ahmed, Sarker Tanzir
    Loguinov, Dmitri
    [J]. 2015 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (INFOCOM), 2015,
  • [5] Efficient clustering of uncertain data streams
    Cheqing Jin
    Jeffrey Xu Yu
    Aoying Zhou
    Feng Cao
    [J]. Knowledge and Information Systems, 2014, 40 : 509 - 539
  • [6] Outlier Detection on Uncertain Data Streams
    Zhu, Bin
    Zhong, Yuling
    Wang, Xite
    Bai, Mei
    [J]. Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2020, 47 (02): : 134 - 140
  • [7] Efficient clustering of uncertain data streams
    Jin, Cheqing
    Yu, Jeffrey Xu
    Zhou, Aoying
    Cao, Feng
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 40 (03) : 509 - 539
  • [8] Probabilistic Skyline Query Processing over Uncertain Data Streams in Edge Computing Environments
    Lai, Chuan-Chi
    Chen, Yan-Lin
    Liu, Chuan-Ming
    Wang, Li-Chun
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [9] Continuous Outlier Detection on Uncertain Data Streams
    Shaikh, Salman Ahmed
    Kitagawa, Hiroyuki
    [J]. 2014 IEEE NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT SENSORS, SENSOR NETWORKS AND INFORMATION PROCESSING (IEEE ISSNIP 2014), 2014,
  • [10] PROBABILISTIC QUERYING OVER UNCERTAIN DATA STREAMS
    Dezfuli, Mohammad G.
    Haghjoo, Mostafa S.
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2012, 20 (05) : 701 - 728