Active Bayesian Assessment of Black-Box Classifiers

被引:0
|
作者
Ji, Disi [1 ]
Logan, Robert L. [1 ]
Smyth, Padhraic [1 ]
Steyvers, Mark [2 ]
机构
[1] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92697 USA
[2] Univ Calif Irvine, Dept Cognit Sci, Irvine, CA 92717 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in machine learning have led to increased deployment of black-box classifiers across a wide variety of applications. In many such situations there is a critical need to both reliably assess the performance of these pre-trained models and to perform this assessment in a label-efficient manner (given that labels may be scarce and costly to collect). In this paper, we introduce an active Bayesian approach for assessment of classifier performance to satisfy the desiderata of both reliability and label-efficiency. We begin by developing inference strategies to quantify uncertainty for common assessment metrics such as accuracy, misclassification cost, and calibration error. We then propose a general framework for active Bayesian assessment using inferred uncertainty to guide efficient selection of instances for labeling, enabling better performance assessment with fewer labels. We demonstrate significant gains from our proposed active Bayesian approach via a series of systematic empirical experiments assessing the performance of modern neural classifiers (e.g., ResNet and BERT) on several standard image and text classification datasets.
引用
收藏
页码:7935 / 7944
页数:10
相关论文
共 50 条
  • [21] THE BLACK-BOX
    KYLE, SA
    NEW SCIENTIST, 1986, 110 (1512) : 61 - 61
  • [22] THE BLACK-BOX
    WISEMAN, J
    ECONOMIC JOURNAL, 1991, 101 (404): : 149 - 155
  • [23] Best-Effort Adversarial Approximation of Black-Box Malware Classifiers
    Ali, Abdullah
    Eshete, Birhanu
    SECURITY AND PRIVACY IN COMMUNICATION NETWORKS (SECURECOMM 2020), PT I, 2020, 335 : 318 - 338
  • [24] Black-Box Adversarial Attack for Deep Learning Classifiers in IoT Applications
    Singh, Abhijit
    Sikdar, Biplab
    2022 IEEE 8TH WORLD FORUM ON INTERNET OF THINGS, WF-IOT, 2022,
  • [25] Defending Black-Box Skeleton-Based Human Activity Classifiers
    Wang, He
    Diao, Yunfeng
    Tan, Zichang
    Guo, Guodong
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2546 - 2554
  • [26] Black-Box Assessment of Optical Spectrum Services
    Kaeval, Kaida
    Elbers, Joerg-Peter
    Grobe, Klaus
    Tikas, Marko
    Fehenberger, Tobias
    Griesser, Helmut
    Jervan, Gert
    2021 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXPOSITION (OFC), 2021,
  • [27] Experimental Study on Generating Multi-modal Explanations of Black-box Classifiers in terms of Gray-box Classifiers
    Alonso, Jose M.
    Toja-Alamancos, J.
    Bugarin, A.
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [28] Differential Assessment of Black-Box AI Agents
    Nayyar, Rashmeet Kaur
    Verma, Pulkit
    Srivastava, Siddharth
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9868 - 9876
  • [30] Safety Assessment: From Black-Box to White-Box
    Kurzidem, Iwo
    Misik, Adam
    Schleiss, Philipp
    Burton, Simon
    2022 IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2022), 2022, : 295 - 300