Good-bootstrap: simultaneous confidence intervals for large alphabet distributions

被引:1
|
作者
Marton, Daniel [1 ]
Painsky, Amichai [2 ,3 ]
机构
[1] Tel Aviv Univ, Stat Dept, Tel Aviv, Israel
[2] Tel Aviv Univ, Ind Engn Dept, Tel Aviv, Israel
[3] Tel Aviv Univ, IL-6997801 Ramat Aviv, Israel
基金
以色列科学基金会;
关键词
Simultaneous confidence intervals; multinomial distribution; good-turing; large alphabet; count data;
D O I
10.1080/10485252.2024.2313706
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Simultaneous confidence intervals (SCI) for multinomial proportions are a corner stone in count data analysis and a key component in many applications. A variety of schemes were introduced over the years, mostly focussing on an asymptotic regime where the sample is large and the alphabet size is relatively small. In this work we introduce a new SCI framework which considers the large alphabet setup. Our proposed framework utilises bootstrap sampling with the Good-Turing probability estimator as a plug-in distribution. We demonstrate the favourable performance of our proposed method in synthetic and real-world experiments. Importantly, we provide an exact analytical expression for the bootstrapped statistic, which replaces the computationally costly sampling procedure. Our proposed framework is publicly available at the first author's Github page.
引用
收藏
页数:15
相关论文
共 50 条