Confidence interval construction in massive data sets

被引:0
|
作者
Song, Kai [1 ,2 ]
Xie, Xiaoyue [3 ]
Shi, Jian [1 ,4 ]
机构
[1] Chinese Acad Sci, Acad Math & Syst Sci, Beijing, Peoples R China
[2] Beijing Inst Technol, Sch Management & Econ, Beijing, Peoples R China
[3] Air Force Engn Univ, Equipment Management & UAV Engn Coll, Xian, Peoples R China
[4] Univ Chinese Acad Sci, Sch Math Sci, Beijing, Peoples R China
关键词
Confidence interval; coverage and length; divide and conquer; hypothesis testing; massive data; optimal local power;
D O I
10.1080/03610926.2022.2100420
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This paper treats the problem of constructing confidence interval with massive data. Two methods using the divide and conquer strategy are proposed. The corresponding hypothesis testing problem is also developed. Theoretical and numerical results are presented to illustrate their effectiveness.
引用
收藏
页码:1035 / 1048
页数:14
相关论文
共 50 条
  • [31] Including systematic uncertainties in confidence interval construction for Poisson statistics
    Conrad, J
    Botner, O
    Hallgren, A
    de los Heros, CP
    [J]. PHYSICAL REVIEW D, 2003, 67 (01)
  • [32] Point-Interval-Valued Sets: Aggregation and Construction
    Bodjanova, Slavka
    Kalina, Martin
    [J]. AGGREGATION FUNCTIONS IN THEORY AND IN PRACTICE, 2018, 581 : 9 - 20
  • [33] Statistical strategies for the analysis of massive data sets
    Hwang, Hon
    Ryan, Louise
    [J]. BIOMETRICAL JOURNAL, 2020, 62 (02) : 270 - 281
  • [34] Mining knowledge in astrophysical massive data sets
    Brescia, Massimo
    Longo, Giuseppe
    Pasian, Fabio
    [J]. NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2010, 623 (02): : 845 - 849
  • [35] Not just for the birds - Archiving massive data sets
    Gorder, PF
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 2006, 8 (03) : 3 - 4
  • [36] Modeling and analyzing massive terrain data sets
    Agarwal, Pankaj K.
    [J]. ALGORITHMS AND COMPUTATION, 2007, 4835 : 1 - 1
  • [37] Segmented regression estimators for massive data sets
    Natarajan, R
    Pednault, E
    [J]. PROCEEDINGS OF THE SECOND SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2002, : 566 - 582
  • [38] Warehousing and mining massive RFID data sets
    Han, Jiawei
    Gonzalez, Hector
    Li, Xiaolei
    Klabjan, Diego
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 1 - 18
  • [39] A computational study of DEA with massive data sets
    Dula, J. H.
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2008, 35 (04) : 1191 - 1203
  • [40] Confidence interval methods for antimicrobial resistance surveillance data
    Erta Kalanxhi
    Gilbert Osena
    Geetanjali Kapoor
    Eili Klein
    [J]. Antimicrobial Resistance & Infection Control, 10