Distribution-free data density estimation in large-scale networks

被引:0
|
作者
Minqi Zhou
Rong Zhang
Weining Qian
Aoying Zhou
机构
[1] East China Normal University,Data Science and Engineering Institute
[2] Wuhan University,State Key Lab of Software Engineering
来源
关键词
distribution-free; data density estimation; random sampling;
D O I
暂无
中图分类号
学科分类号
摘要
Estimating the global data distribution in large-scale networks is an important issue and yet to be well addressed. It can benefit many applications, especially in the cloud computing era, such as load balancing analysis, query processing, and data mining. Inspired by the inversion method for random variate (number) generation, in this paper, we present a novel model called distribution-free data density estimation for large ring-based networks to achieve high estimation accuracy with low estimation cost regardless of the distribution models of the underlying data. This model generates random samples for any arbitrary distribution by sampling the global cumulative distribution function and is free from sampling bias. Armed with this estimation method, we can estimate data densities over both one-dimensional and multidimensional tuple sets, where each dimension could be either continuous or discrete as its domain. In large-scale networks, the key idea for distribution-free estimation is to sample a small subset of peers for estimating the global data distribution over the data domain. Algorithms on computing and sampling the global cumulative distribution function based on which the global data distribution is estimated are introduced with a detailed theoretical analysis. Our extensive performance study confirms the effectiveness and efficiency of our methods in large ring-based networks.
引用
收藏
页码:1220 / 1240
页数:20
相关论文
共 50 条
  • [41] Subnetwork estimation for spatial autoregressive models in large-scale networks*
    Li, Xuetong
    Wang, Feifei
    Lan, Wei
    Wang, Hansheng
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2023, 17 (01): : 1768 - 1805
  • [42] Demand estimation for perimeter control in large-scale traffic networks
    Kumarage, Sakitha
    Yildirimoglu, Mehmet
    Zheng, Zuduo
    [J]. 2023 8TH INTERNATIONAL CONFERENCE ON MODELS AND TECHNOLOGIES FOR INTELLIGENT TRANSPORTATION SYSTEMS, MT-ITS, 2023,
  • [43] Cooperative Bayesian Estimation of Vehicular Traffic in Large-Scale Networks
    Pascale, Alessandra
    Nicoli, Monica
    Spagnolini, Umberto
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2014, 15 (05) : 2074 - 2088
  • [44] Estimation of a Population Size in Large-Scale Wireless Sensor Networks
    彭绍亮
    李姗姗
    廖湘科
    彭宇行
    肖侬
    [J]. Journal of Computer Science & Technology, 2009, 24 (05) : 987 - 997
  • [45] Estimation of a Population Size in Large-Scale Wireless Sensor Networks
    Shao-Liang Peng
    Shan-Shan Li
    Xiang-Ke Liao
    Yu-Xing Peng
    Nong Xiao
    [J]. Journal of Computer Science and Technology, 2009, 24 : 987 - 997
  • [46] Functional observability and target state estimation in large-scale networks
    Montanari, Arthur N.
    Duan, Chao
    Aguirre, Luis A.
    Motter, Adilson E.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2022, 119 (01)
  • [47] Nonparametric change detection and estimation in large-scale sensor networks
    He, T
    Ben-David, S
    Tong, L
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (04) : 1204 - 1217
  • [48] Estimation of a Population Size in Large-Scale Wireless Sensor Networks
    Peng, Shao-Liang
    Li, Shan-Shan
    Liao, Xiang-Ke
    Peng, Yu-Xing
    Xiao, Nong
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2009, 24 (05) : 987 - 997
  • [49] LARGE-SCALE DENSITY DISTRIBUTION IN THERMAL RADIO-SOURCES
    GULYAEV, SA
    MENSHCHIKOV, AB
    [J]. ASTRONOMICHESKII ZHURNAL, 1981, 58 (06): : 1207 - 1212
  • [50] Distribution-free dispersion tests for data with ties
    Edwardes, MDDB
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2001, 13 (03) : 311 - 330