Selection and Aggregation of Conformal Prediction Sets

被引:0
|
作者
Yang, Yachong [1 ]
Kuchibhotla, Arun Kumar [2 ]
机构
[1] Univ Penn, Dept Stat & Data Sci, Philadelphia, PA 19104 USA
[2] Carnegie Mellon Univ, Dept Stat & Data Sci, Pittsburgh, PA USA
关键词
Cross-validation; DKW inequality; Oracle inequality; Quantile function; Ridge regression;
D O I
10.1080/01621459.2024.2344700
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Conformal prediction is a generic methodology for finite-sample valid distribution-free prediction. This technique has garnered a lot of attention in the literature partly because it can be applied with any machine learning algorithm that provides point predictions to yield valid prediction regions. Of course, the efficiency (width/volume) of the resulting prediction region depends on the performance of the machine learning algorithm. In the context of point prediction, several techniques (such as cross-validation) exist to select one of many machine learning algorithms for better performance. In contrast, such selection techniques are seldom discussed in the context of set prediction (or prediction regions). In this article, we consider the problem of obtaining the smallest conformal prediction region given a family of machine learning algorithms. We provide two general-purpose selection algorithms and consider coverage as well as width properties of the final prediction region. The first selection method yields the smallest width prediction region among the family of conformal prediction regions for all sample sizes but only has an approximate coverage guarantee. The second selection method has a finite sample coverage guarantee but only attains close to the smallest width. The approximate optimal width property of the second method is quantified via an oracle inequality. As an illustration, we consider the use of aggregation of nonparametric regression estimators in the split conformal method with the absolute residual conformal score. Supplementary materials for this article are available online, including a standardized description of the materials available for reproducing the work.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Stable Conformal Prediction Sets
    Ndiaye, Eugene
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [2] On the Expected Size of Conformal Prediction Sets
    Dhillon, Guneet S.
    Deligiannidis, George
    Rainforth, Tom
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [3] Bayesian Optimization with Conformal Prediction Sets
    Stanton, Samuel
    Maddox, Wesley
    Wilson, Andrew Gordon
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
  • [4] Conformal Prediction Sets for Ordinal Classification
    Dey, Prasenjit
    Merugu, Srujana
    Kaveri, Sivaramakrishnan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] Conformal Prediction Sets with Limited False Positives
    Fisch, Adam
    Schuster, Tal
    Jaakkola, Tommi
    Barzilay, Regina
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [6] Selection by Prediction with Conformal p-values
    Jin, Ying
    Candes, Emmanuel J.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [7] Evidential Uncertainty Sets in Deep Classifiers Using Conformal Prediction
    Karimi, Hamed
    Samavi, Reza
    13TH SYMPOSIUM ON CONFORMAL AND PROBABILISTIC PREDICTION WITH APPLICATIONS, 2024, 230 : 466 - 489
  • [8] NESTED CONFORMAL PREDICTION SETS FOR CLASSIFICATION WITH APPLICATIONS TO PROBATION DATA
    Kuchibhotla, Arun K.
    Berk, Richard A.
    ANNALS OF APPLIED STATISTICS, 2023, 17 (01): : 761 - 785
  • [9] Multi-split conformal prediction via Cauchy aggregation
    Wu, Xiaoyang
    Huo, Yuyang
    Zou, Changliang
    STAT, 2023, 12 (01):
  • [10] Selecting informative conformal prediction sets with false coverage rate control
    Gazin, Ulysse
    Heller, Ruth
    Marandon, Ariane
    Roquain, Etienne
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2025,