Selective conformal inference with false coverage-statement rate control

被引:2
|
作者
Bao, Yajie [1 ]
Huo, Yuyang [2 ]
Ren, Haojie [1 ]
Zou, Changliang [2 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Math Sci, Shanghai 200240, Peoples R China
[2] Nankai Univ, Sch Stat & Data Sci, Tianjin 300071, Peoples R China
基金
中国国家自然科学基金;
关键词
Conditional empirical distribution; Distribution-free; Nonexchangeable condition; Post-selection inference; Prediction interval; Split conformal; DISCOVERY RATE; CONFIDENCE-INTERVALS; STEPUP PROCEDURES; PREDICTION; POINT;
D O I
10.1093/biomet/asae010
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Conformal inference is a popular tool for constructing prediction intervals. We consider here the scenario of post-selection/selective conformal inference, that is, prediction intervals are reported only for individuals selected from unlabelled test data. To account for multiplicity, we develop a general split conformal framework to construct selective prediction intervals with the false coverage-statement rate control. We first investigate the false coverage rate-adjusted method of in the present setting, and show that it is able to achieve false coverage-statement rate control, but yields uniformly inflated prediction intervals. We then propose a novel solution to the problem called selective conditional conformal prediction. Our method performs selection procedures on both the calibration set and test set, and then constructs conformal prediction intervals for the selected test candidates with the aid of the conditional empirical distribution obtained by the post-selection calibration set. When the selection rule is exchangeable, we show that our proposed method can exactly control the false coverage-statement rate in a model-free and distribution-free guarantee. For nonexchangeable selection procedures involving the calibration set, we provide non-asymptotic bounds for the false coverage-statement rate under mild distributional assumptions. Numerical results confirm the effectiveness and robustness of our method under false coverage-statement rate control and show that it achieves more narrowed prediction intervals over existing methods across various settings.
引用
收藏
页码:727 / 742
页数:16
相关论文
共 50 条
  • [31] False Discovery Rate Control via Data Splitting
    Dai, Chenguang
    Lin, Buyu
    Xing, Xin
    Liu, Jun S.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (544) : 2503 - 2520
  • [32] Online control of the false discovery rate with decaying memory
    Ramdas, Aaditya
    Yang, Fanny
    Wainwright, Martin J.
    Jordan, Michael, I
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [33] False discovery rate control with e-values
    Wang, Ruodu
    Ramdas, Aaditya
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2022, 84 (03) : 822 - 852
  • [34] Adaptive procedures for directional false discovery rate control
    Leung, Dennis
    Tran, Ninh
    ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (01): : 706 - 741
  • [35] PAPRIKA: Private Online False Discovery Rate Control
    Zhang, Wanrong
    Kamath, Gautam
    Cummings, Rachel
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [36] Optimal false discovery rate control for dependent data
    Xie, Jichun
    Cai, T. Tony
    Maris, John
    Li, Hongzhe
    STATISTICS AND ITS INTERFACE, 2011, 4 (04) : 417 - 430
  • [37] False positive rate control for positive unlabeled learning
    Kong, Shuchen
    Shen, Weiwei
    Zheng, Yingbin
    Zhang, Ao
    Pu, Jian
    Wang, Jun
    NEUROCOMPUTING, 2019, 367 : 13 - 19
  • [38] Sequential selection procedures and false discovery rate control
    G'Sell, Max Grazier
    Wager, Stefan
    Chouldechova, Alexandra
    Tibshirani, Robert
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2016, 78 (02) : 423 - 444
  • [39] False discovery rate control via debiased lasso
    Javanmard, Adel
    Javadi, Hamid
    ELECTRONIC JOURNAL OF STATISTICS, 2019, 13 (01): : 1212 - 1253
  • [40] ADAPTIVE FALSE DISCOVERY RATE CONTROL FOR HETEROGENEOUS DATA
    Habiger, Joshua D.
    STATISTICA SINICA, 2017, 27 (04) : 1731 - 1756