Computing rarity on uncertain data

被引:0
|
作者
CheQing Jin
MinQi Zhou
AoYing Zhou
机构
[1] East China Normal University,Shanghai Key Laboratory of Trustworthy Computing, Software Engineering Institute
来源
关键词
rarity; uncertain data; possible world;
D O I
暂无
中图分类号
学科分类号
摘要
The essence of uncertain data management has been well adopted since data uncertainty widely exists in lots of applications, such as Web, sensor networks, etc. Most of the uncertain data models are based on the possible world semantics. Because the number of the possible worlds will blowup exponentially with the growth of the data set, it is much more challenging to handle uncertain data than deterministic data. In this paper, we take the first attempt to study the rarity, an important statistic that describes the proportion of items with the same frequency, upon uncertain data. We have proposed three novel solutions, including an exact method and an approximate method to compute the rarity of a given frequency respectively, and a method to find the frequency of the maximum rarity. Analysis in theorem and extensive experimental results demonstrate the effectiveness and efficiency of the proposed solutions.
引用
收藏
页码:2028 / 2039
页数:11
相关论文
共 50 条
  • [11] Investigating class rarity in big data
    Tawfiq Hasanin
    Taghi M. Khoshgoftaar
    Joffrey L. Leevy
    Richard A. Bauder
    Journal of Big Data, 7
  • [12] Investigating class rarity in big data
    Hasanin, Tawfiq
    Khoshgoftaar, Taghi M.
    Leevy, Joffrey L.
    Bauder, Richard A.
    JOURNAL OF BIG DATA, 2020, 7 (01)
  • [13] Computing uncertainty with uncertain numbers
    Hall, B. D.
    METROLOGIA, 2006, 43 (06) : L56 - L61
  • [14] Probabilistic Skyline Query Processing over Uncertain Data Streams in Edge Computing Environments
    Lai, Chuan-Chi
    Chen, Yan-Lin
    Liu, Chuan-Ming
    Wang, Li-Chun
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [15] An Empirical Study on Class Rarity in Big Data
    Bauder, Richard A.
    Khoshgoftaar, Taghi M.
    Hasanin, Tawfiq
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 785 - 790
  • [16] Computing enclosures for uncertain biochemical systems
    August, E.
    Koeppl, H.
    IET SYSTEMS BIOLOGY, 2012, 6 (06) : 232 - 240
  • [17] Computing frequency responses of uncertain systems
    Chen, JJ
    Hwang, C
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-FUNDAMENTAL THEORY AND APPLICATIONS, 1998, 45 (03): : 304 - 307
  • [18] E = MC3: Managing Uncertain Enterprise Data in a Cluster-Computing Environment
    Xu, Fei
    Beyer, Kevin
    Ercegovac, Vuk
    Haas, Peter J.
    Shekita, Eugene J.
    ACM SIGMOD/PODS 2009 CONFERENCE, 2009, : 441 - 454
  • [19] Estimating rarity and similarity over data stream windows
    Datar, M
    Muthukrishnan, S
    ALGORITHMS-ESA 2002, PROCEEDINGS, 2002, 2461 : 323 - 334
  • [20] Computing the Center of Uncertain Points on Tree Networks
    Wang, Haitao
    Zhang, Jingru
    ALGORITHMICA, 2017, 78 (01) : 232 - 254