Strong consistency of the good-turing estimator

被引:9
|
作者
Wagner, Aaron B. [1 ]
Viswanath, Pramod [1 ,2 ]
Kulkarni, Sanjeev R. [3 ]
机构
[1] Univ Illinois, Coordinated Sci Lab, 1101 W Springfield Ave, Urbana, IL 61801 USA
[2] Univ Illinois, Dept Elect Engn, Urbana, IL 61801 USA
[3] Princeton Univ, Dept EE, Princeton, NJ USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/ISIT.2006.262066
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
We consider the problem of estimating the total probability of all symbols that appear with a given frequency in a string of i.i.d. random variables with unknown distribution. We focus on the regime in which the block length is large yet no symbol appears frequently in the string. This is accomplished by allowing the distribution to change with the block length. Under a natural convergence assumption on the sequence of underlying distributions, we show that the total probabilities converge to a deterministic limit, which we characterize. We then show that the Good-Turing total probability estimator is strongly consistent.
引用
下载
收藏
页码:2526 / +
页数:2
相关论文
共 50 条
  • [11] Competitive Distribution Estimation: Why is Good-Turing Good
    Orlitsky, Alon
    Suresh, Ananda Theertha
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [12] Rediscovery of Good-Turing estimators via Bayesian Nonparametrics
    Favaro, Stefano
    Nipoti, Bernardo
    Teh, Yee Whye
    BIOMETRICS, 2016, 72 (01) : 136 - 145
  • [13] Generalized Good-Turing Improves Missing Mass Estimation
    Painsky, Amichai
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (543) : 1890 - 1899
  • [14] 一种增强的Good-Turing方法
    毛隽
    何炎祥
    刘娟
    成钢
    武汉大学学报(理学版), 2007, (05) : 527 - 530
  • [15] On the Application of Good-Turing Statistics to Quantify Convergence of Biomolecular Simulations
    Koukos, Panagiotis I.
    Glykos, Nicholas M.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (01) : 209 - 217
  • [16] Good-turing estimation of the number of operating sensors: A large deviations analysis
    Budianu, C
    Tong, L
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING SIGNAL PROCESSING THEORY AND METHODS, 2004, : 1029 - 1032
  • [17] Good-turing estimation from word lattices for unsupervised language model adaptation
    Riley, M
    Roark, B
    Sproat, R
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 453 - 458
  • [18] Study on good-turing and a novel smoothing method based on real corpora for language models
    Huang, FL
    Yu, MS
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 3741 - 3745
  • [19] Strong consistency of the general rank estimator
    Bindele, Huybrechts F.
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (02) : 532 - 539
  • [20] Deciphering the enigma of undetected species, phylogenetic, and functional diversity based on Good-Turing theory
    Chao, Anne
    Chiu, Chun-Huo
    Colwell, Robert K.
    Magnago, Luiz Fernando S.
    Chazdon, Robin L.
    Gotelli, Nicholas J.
    ECOLOGY, 2017, 98 (11) : 2914 - 2929