Computing rarity on uncertain data

被引:1
|
作者
Jin CheQing [1 ]
Zhou MinQi [1 ]
Zhou AoYing [1 ]
机构
[1] E China Normal Univ, Inst Software Engn, Shanghai Key Lab Trustworthy Comp, Shanghai 200062, Peoples R China
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
rarity; uncertain data; possible world; FRAMEWORK;
D O I
10.1007/s11432-011-4378-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The essence of uncertain data management has been well adopted since data uncertainty widely exists in lots of applications, such as Web, sensor networks, etc. Most of the uncertain data models are based on the possible world semantics. Because the number of the possible worlds will blowup exponentially with the growth of the data set, it is much more challenging to handle uncertain data than deterministic data. In this paper, we take the first attempt to study the rarity, an important statistic that describes the proportion of items with the same frequency, upon uncertain data. We have proposed three novel solutions, including an exact method and an approximate method to compute the rarity of a given frequency respectively, and a method to find the frequency of the maximum rarity. Analysis in theorem and extensive experimental results demonstrate the effectiveness and efficiency of the proposed solutions.
引用
收藏
页码:2028 / 2039
页数:12
相关论文
共 50 条
  • [1] Computing rarity on uncertain data
    JIN CheQing
    [J]. Science China(Information Sciences), 2011, 54 (10) : 2028 - 2039
  • [2] Computing rarity on uncertain data
    CheQing Jin
    MinQi Zhou
    AoYing Zhou
    [J]. Science China Information Sciences, 2011, 54 : 2028 - 2039
  • [3] Computing All Skyline Probabilities for Uncertain Data
    Atallah, Mikhail J.
    Qi, Yinian
    [J]. PODS'09: PROCEEDINGS OF THE TWENTY-EIGHTH ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2009, : 279 - 287
  • [4] A note on computing the center of uncertain data on the real line
    Wang, Haitao
    Zhang, Jingru
    [J]. OPERATIONS RESEARCH LETTERS, 2016, 44 (03) : 370 - 373
  • [5] Online Computing Quantile Summaries Over Uncertain Data Streams
    Liang, Chunquan
    Li, Mei
    Liu, Bin
    [J]. IEEE ACCESS, 2019, 7 : 10916 - 10926
  • [6] Computing a k-route over uncertain geographical data
    Safra, Eliyahu
    Kanza, Yaron
    Dolev, Nir
    Sagiv, Yehoshua
    Doytsher, Yerach
    [J]. ADVANCES IN SPATIAL AND TEMPORAL DATABASES, PROCEEDINGS, 2007, 4605 : 276 - +
  • [7] Computing the Probabilities of Operations in Vector Models for Uncertain Spatial Data
    Tossebro, Erlend
    Nygard, Mads
    [J]. SITIS 2008: 4TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY AND INTERNET BASED SYSTEMS, PROCEEDINGS, 2008, : 78 - +
  • [8] Parallel skyline queries over uncertain data streams in cloud computing environments
    Li, Xiaoyong
    Wang, Yijie
    Li, Xiaoling
    Wang, Yuan
    [J]. INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2014, 10 (01) : 24 - 53
  • [9] Stromal tumor of the prostate of uncertain malignant potential. A diagnostic rarity
    Nikolov, N.
    Kramm, T.
    Haroske, G.
    Gieseler, B.
    Steinbach, F.
    [J]. UROLOGE, 2008, 47 (06): : 753 - 756
  • [10] Investigating class rarity in big data
    Tawfiq Hasanin
    Taghi M. Khoshgoftaar
    Joffrey L. Leevy
    Richard A. Bauder
    [J]. Journal of Big Data, 7