Selecting informative conformal prediction sets with false coverage rate control

被引:0
|
作者
Gazin, Ulysse [1 ,2 ]
Heller, Ruth [3 ]
Marandon, Ariane [4 ]
Roquain, Etienne [5 ,6 ]
机构
[1] Univ Paris Cite, F-75013 Paris, France
[2] Sorbonne Univ, CNRS, Lab Probabil Stat & Modelisat, F-75013 Paris, France
[3] Tel Aviv Univ, Dept Stat & Operat Res, Tel Aviv, Israel
[4] Alan Turing Inst, London, England
[5] Sorbonne Univ, F-75005 Paris, France
[6] Univ Paris Cite, CNRS, Lab Probabil Stat & Modelisat, F-75005 Paris, France
关键词
classification; false discovery rate; label shift; prediction interval; regression; selective inference; DISCOVERY RATE;
D O I
10.1093/jrsssb/qkae120
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In supervised learning, including regression and classification, conformal methods provide prediction sets for the outcome/label with finite sample coverage for any machine learning predictor. We consider here the case where such prediction sets come after a selection process. The selection process requires that the selected prediction sets be 'informative' in a well-defined sense. We consider both the classification and regression settings where the analyst may consider as informative only the sample with prediction sets small enough, excluding null values, or obeying other appropriate 'monotone' constraints. We develop a unified framework for building such informative conformal prediction sets while controlling the false coverage rate (FCR) on the selected sample. While conformal prediction sets after selection have been the focus of much recent literature in the field, the new introduced procedures, called InfoSP and InfoSCOP, are to our knowledge the first ones providing FCR control for informative prediction sets. We show the usefulness of our resulting procedures on real and simulated data.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Evaluating the Utility of Conformal Prediction Sets for AI-Advised Image Labeling
    Zhang, Dongping
    Chatzimparmpas, Angelos
    Kamali, Negar
    Hullman, Jessica
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024), 2024,
  • [42] Revisiting Coverage Control in Nonconvex Environments using Visibility Sets
    Klodt, Lukas
    Haumann, Dominik
    Willert, Volker
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 82 - 89
  • [43] On control of the false discovery rate under no assumption of dependency
    Guo, Wenge
    Rao, M. Bhaskara
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2008, 138 (10) : 3176 - 3188
  • [44] False Discovery Rate Control via Data Splitting
    Dai, Chenguang
    Lin, Buyu
    Xing, Xin
    Liu, Jun S.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (544) : 2503 - 2520
  • [45] Online control of the false discovery rate with decaying memory
    Ramdas, Aaditya
    Yang, Fanny
    Wainwright, Martin J.
    Jordan, Michael, I
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [46] False discovery rate control with e-values
    Wang, Ruodu
    Ramdas, Aaditya
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2022, 84 (03) : 822 - 852
  • [47] Adaptive procedures for directional false discovery rate control
    Leung, Dennis
    Tran, Ninh
    ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (01): : 706 - 741
  • [48] PAPRIKA: Private Online False Discovery Rate Control
    Zhang, Wanrong
    Kamath, Gautam
    Cummings, Rachel
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [49] Optimal false discovery rate control for dependent data
    Xie, Jichun
    Cai, T. Tony
    Maris, John
    Li, Hongzhe
    STATISTICS AND ITS INTERFACE, 2011, 4 (04) : 417 - 430
  • [50] False positive rate control for positive unlabeled learning
    Kong, Shuchen
    Shen, Weiwei
    Zheng, Yingbin
    Zhang, Ao
    Pu, Jian
    Wang, Jun
    NEUROCOMPUTING, 2019, 367 : 13 - 19