Post-hoc Estimators for Learning to Defer to an Expert

被引:0
|
作者
Narasimhan, Harikrishna [1 ]
Jitkrittum, Wittawat [2 ]
Menon, Aditya Krishna [2 ]
Rawat, Ankit Singh [2 ]
Kumar, Sanjiv [2 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
[2] Google Res, New York, NY USA
关键词
CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many practical settings allow a classifier to defer predictions to one or more costly experts. For example, the learning to defer paradigm allows a classifier to defer to a human expert, at some monetary cost. Similarly, the adaptive inference paradigm allows a base model to defer to one or more large models, at some computational cost. The goal in these settings is to learn classification and deferral mechanisms to optimise a suitable accuracy-cost tradeoff. To achieve this, a central issue studied in prior work is the design of a coherent loss function for both mechanisms. In this work, we demonstrate that existing losses can underfit the training set when there is a non-trivial deferral cost, owing to an implicit application of a high level of label smoothing. To resolve this, we propose two post-hoc estimators that fit a deferral function on top of a base model, either by threshold correction, or by learning when the base model's error rate exceeds the cost of deferring to the expert. Both approaches are equipped with theoretical guarantees, and empirically yield effective accuracy-cost tradeoffs on learning to defer and adaptive inference benchmarks.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Learning to Defer with Limited Expert Predictions
    Hemmer, Patrick
    Thede, Lukas
    Voessing, Michael
    Jakubik, Johannes
    Kuehl, Niklas
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 5, 2023, : 6002 - 6011
  • [2] POST-HOC AND HYPOPROTHROMBINEMIA
    GALINSKY, RE
    FORNI, PJ
    MCGUIRE, GG
    TONG, TG
    BENOWITZ, N
    BECKER, CE
    ANNALS OF INTERNAL MEDICINE, 1975, 83 (02) : 286 - 286
  • [3] ON POST-HOC BLOCKING
    BONETT, DG
    EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1982, 42 (01) : 35 - 39
  • [4] Grammatical Inference and Machine Learning Approaches to Post-Hoc LangSec
    Curley, Sheridan S.
    Harang, Richard E.
    2016 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2016), 2016, : 171 - 178
  • [5] Comparing Strategies for Post-Hoc Explanations in Machine Learning Models
    Vij, Aabhas
    Nanjundan, Preethi
    MOBILE COMPUTING AND SUSTAINABLE INFORMATICS, 2022, 68 : 585 - 592
  • [6] You said post-hoc?...
    de Roux-Serratrice, C
    Serratrice, J
    Champsaur, P
    Faucher, B
    Ené, N
    Granel, B
    Swiader, L
    Coulange, C
    Disdier, P
    Weiller, P
    REVUE DE MEDECINE INTERNE, 2005, 26 : S282 - S283
  • [7] Post-hoc Uncertainty Learning Using a Dirichlet Meta-Model
    Shen, Maohao
    Bu, Yuheng
    Sattigeri, Prasanna
    Ghosh, Soumya
    Das, Subhro
    Wornell, Gregory
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9772 - 9781
  • [8] POST-HOC, NON-ERGO-PROPTER-HOC
    PEIRICK, J
    IEEE SPECTRUM, 1994, 31 (03) : 6 - 6
  • [9] LLMs for the post-hoc creation of provenance
    Almuntashiri, Abdullah Hamed
    Ibanez, Luis-Daniel
    Chapman, Adriane
    9TH IEEE EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS, EUROS&PW 2024, 2024, : 562 - 566
  • [10] Consistent Post-Hoc Explainability in Federated Learning through Federated Fuzzy Clustering
    Ducange, Pietro
    Marcelloni, Francesco
    Renda, Alessandro
    Ruffini, Fabrizio
    2024 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, FUZZ-IEEE 2024, 2024,