Distribution-free, Risk-controlling Prediction Sets

被引:38
|
作者
Bates, Stephen [1 ]
Angelopoulos, Anastasios [1 ]
Lei, Lihua [2 ]
Malik, Jitendra [1 ]
Jordan, Michael [1 ]
机构
[1] Univ Calif Berkeley, 387 Soda Hall, Berkeley, CA 94720 USA
[2] Stanford Univ, 390 Serra Mall, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
Uncertainty quantification; conformal prediction; predictive uncertainty; set-valued prediction; INEQUALITIES;
D O I
10.1145/3478535
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
While improving prediction accuracy has been the focus of machine learning in recent years, this alone does not suffice for reliable decision-making. Deploying learning systems in consequential settings also requires calibrating and communicating the uncertainty of predictions. To convey instance-wise uncertainty for prediction tasks, we show how to generate set-valued predictions from a black-box predictor that controls the expected loss on future test points at a user-specified level. Our approach provides explicit finite-sample guarantees for any dataset by using a holdout set to calibrate the size of the prediction sets. This framework enables simple, distribution-free, rigorous error control for many tasks, and we demonstrate it in five large-scale machine learning problems: (1) classification problems where some mistakes are more costly than others; (2) multi-label classification, where each observation has multiple associated labels; (3) classification problems where the labels have a hierarchical structure; (4) image segmentation, where we wish to predict a set of pixels containing an object of interest; and (5) protein structure prediction. Last, we discuss extensions to uncertainty quantification for ranking, metric learning, and distributionally robust learning.
引用
收藏
页数:34
相关论文
共 50 条
  • [1] Distribution-Free Prediction Sets
    Lei, Jing
    Robins, James
    Wasserman, Larry
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (501) : 278 - 287
  • [2] Distribution-Free Prediction Sets for Two-Layer Hierarchical Models
    Dunn, Robin
    Wasserman, Larry
    Ramdas, Aaditya
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (544) : 2491 - 2502
  • [3] Distribution-free risk analysis
    Gray, Ander
    Ferson, Scott
    Kreinovich, Vladik
    Patelli, Edoardo
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2022, 146 : 133 - 156
  • [4] A distribution-free method for probabilistic prediction
    Shi, Qitao
    Zhang, Ya-Lin
    Yu, Lu
    Zhu, Feng
    Li, Longfei
    Zhou, Jun
    Fang, Yanming
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [5] DISTRIBUTION-FREE AND OTHER PREDICTION INTERVALS
    KONIJN, HS
    [J]. AMERICAN STATISTICIAN, 1987, 41 (01): : 11 - 15
  • [6] Utilizing risk-controlling prediction calibration to reduce false alarm rates in epileptic seizure prediction
    Segal, Galya
    Keidar, Noam
    Lotan, Roy Maor
    Romano, Yaniv
    Herskovitz, Moshe
    Yaniv, Yael
    [J]. FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [7] Research on risk-controlling strategy for aeronautic project
    Fu, Q
    Li, XJ
    An, S
    [J]. PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS I AND II, 2003, : 2289 - 2293
  • [8] Distribution-free prediction intervals with conformal prediction for acoustical estimation
    Khurjekar, Ishan
    Gerstoft, Peter
    [J]. Journal of the Acoustical Society of America, 2024, 156 (04): : 2656 - 2667
  • [9] Adaptive, Distribution-Free Prediction Intervals for Deep Networks
    Kivaranovic, Danijel
    Johnson, Kory D.
    Leeb, Hannes
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 4346 - 4355
  • [10] Distribution-free prediction intervals in mixed linear models
    Jiang, JM
    Zhang, WH
    [J]. STATISTICA SINICA, 2002, 12 (02) : 537 - 553