Selective conformal inference with false coverage-statement rate control

被引:2
|
作者
Bao, Yajie [1 ]
Huo, Yuyang [2 ]
Ren, Haojie [1 ]
Zou, Changliang [2 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Math Sci, Shanghai 200240, Peoples R China
[2] Nankai Univ, Sch Stat & Data Sci, Tianjin 300071, Peoples R China
基金
中国国家自然科学基金;
关键词
Conditional empirical distribution; Distribution-free; Nonexchangeable condition; Post-selection inference; Prediction interval; Split conformal; DISCOVERY RATE; CONFIDENCE-INTERVALS; STEPUP PROCEDURES; PREDICTION; POINT;
D O I
10.1093/biomet/asae010
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Conformal inference is a popular tool for constructing prediction intervals. We consider here the scenario of post-selection/selective conformal inference, that is, prediction intervals are reported only for individuals selected from unlabelled test data. To account for multiplicity, we develop a general split conformal framework to construct selective prediction intervals with the false coverage-statement rate control. We first investigate the false coverage rate-adjusted method of in the present setting, and show that it is able to achieve false coverage-statement rate control, but yields uniformly inflated prediction intervals. We then propose a novel solution to the problem called selective conditional conformal prediction. Our method performs selection procedures on both the calibration set and test set, and then constructs conformal prediction intervals for the selected test candidates with the aid of the conditional empirical distribution obtained by the post-selection calibration set. When the selection rule is exchangeable, we show that our proposed method can exactly control the false coverage-statement rate in a model-free and distribution-free guarantee. For nonexchangeable selection procedures involving the calibration set, we provide non-asymptotic bounds for the false coverage-statement rate under mild distributional assumptions. Numerical results confirm the effectiveness and robustness of our method under false coverage-statement rate control and show that it achieves more narrowed prediction intervals over existing methods across various settings.
引用
收藏
页码:727 / 742
页数:16
相关论文
共 50 条
  • [21] Distributed False Discovery Rate Control with Quantization
    Xiang, Yu
    2019 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2019, : 246 - 249
  • [22] Contextual Online False Discovery Rate Control
    Chen, Shiyun
    Kasiviswanathan, Shiva Prasad
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 952 - 960
  • [23] The application conditions of false discovery rate control
    Zhang, Hongbin
    Le, Xin
    Xiang, Tingxiu
    GENES & DISEASES, 2023, 10 (04) : 1145 - 1146
  • [24] Symmetric directional false discovery rate control
    Holte, Sarah E.
    Lee, Eva K.
    Mei, Yajun
    STATISTICAL METHODOLOGY, 2016, 33 : 71 - 82
  • [25] Copulas, uncertainty, and false discovery rate control
    Cerquet, Roy
    Lupi, Claudio
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2018, 100 : 105 - 114
  • [26] MOTION DETECTION WITH FALSE DISCOVERY RATE CONTROL
    McHugh, J. Mike
    Konrad, Janusz
    Saligrama, Venkatesh
    Jodoin, Pierre-Marc
    Castanon, David
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 873 - 876
  • [27] Multiple Attribute Control Charts with False Discovery Rate Control
    Li, Yanting
    Tsung, Fugee
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2012, 28 (08) : 857 - 871
  • [28] Transcriptome data are insufficient to control false discoveries in regulatory network inference
    Kernfeld, Eric
    Keener, Rebecca
    Cahan, Patrick
    Battle, Alexis
    CELL SYSTEMS, 2024, 15 (08)
  • [29] Selective prediction-set models with coverage rate guarantees
    Feng, Jean
    Sondhi, Arjun
    Perry, Jessica
    Simon, Noah
    BIOMETRICS, 2023, 79 (02) : 811 - 825
  • [30] On control of the false discovery rate under no assumption of dependency
    Guo, Wenge
    Rao, M. Bhaskara
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2008, 138 (10) : 3176 - 3188