Reliability evaluation of individual predictions: a data-centric approach

被引:0
|
作者
Shahbazi, Nima [1 ]
Asudeh, Abolfazl [1 ]
机构
[1] Univ Illinois, Chicago, IL 60607 USA
来源
VLDB JOURNAL | 2024年 / 33卷 / 04期
基金
美国国家科学基金会;
关键词
VORONOI DIAGRAMS;
D O I
10.1007/s00778-024-00857-w
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning models only provide probabilistic guarantees on the expected loss of random samples from the distribution represented by their training data. As a result, a model with high accuracy, may or may not be reliable for predicting an individual query point. To address this issue, XAI aims to provide explanations of individual predictions, while approaches such as conformal predictions, probabilistic predictions, and prediction intervals count on the model's certainty in its prediction to identify unreliable cases. Conversely, instead of relying on the model itself, we look for insights in the training data. That is, following the fact a model's performance is limited to the data it has been trained on, we ask "is a model trained on a given data set, fit for making a specific prediction?". Specifically, we argue that a model's prediction is not reliable if (i) there were not enough similar instances in the training set to the query point, and (ii) if there is a high fluctuation (uncertainty) in the vicinity of the query point in the training set. Using these two observations, we propose data-centric reliability measures for individual predictions and develop novel algorithms for efficient and effective computation of the reliability measures during inference time. The proposed algorithms learn the necessary components of the measures from the data itself and are sublinear, which makes them scalable to very large and multi-dimensional settings. Furthermore, an estimator is designed to enable no-data access during the inference time. We conduct extensive experiments using multiple real and synthetic data sets and different tasks, which reflect a consistent correlation between distrust values and model performance.
引用
收藏
页码:1203 / 1230
页数:28
相关论文
共 50 条
  • [1] Data-centric Reliability Management in GPUs
    Kadam, Gurunath
    Smirni, Evgenia
    Jog, Adwait
    [J]. 51ST ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN 2021), 2021, : 271 - 283
  • [2] A Data-Centric Approach to Synchronization
    Dolby, Julian
    Hammer, Christian
    Marino, Daniel
    Tip, Frank
    Vaziri, Mandana
    Vitek, Jan
    [J]. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2012, 34 (01):
  • [3] A Data-Centric Approach to Loss Mechanisms
    Senior, Alistair C.
    Miller, Robert J.
    [J]. JOURNAL OF TURBOMACHINERY-TRANSACTIONS OF THE ASME, 2024, 146 (04):
  • [4] A Data-Centric Approach to Change Management
    Nwokeji, Joshua Chibuike
    Clark, Tony
    Barn, Balbir
    Kulkarni, Vinay
    Anum, Sheena O.
    [J]. PROCEEDINGS OF THE 2015 IEEE 19TH INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE, 2015, : 185 - 190
  • [5] A DATA-CENTRIC APPROACH TO LOSS MECHANISMS
    Senior, Alistair C.
    Miller, Robert J.
    [J]. PROCEEDINGS OF ASME TURBO EXPO 2023: TURBOMACHINERY TECHNICAL CONFERENCE AND EXPOSITION, GT2023, VOL 13A, 2023,
  • [6] A data-centric approach to distributed tracing
    Popa, Nicolae Marian
    Oprescu, Ana
    [J]. 11TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM 2019), 2019, : 209 - 216
  • [7] A data-centric perspective on the information needed for hydrological uncertainty predictions
    Auer, Andreas
    Gauch, Martin
    Kratzert, Frederik
    Nearing, Grey
    Hochreiter, Sepp
    Klotz, Daniel
    [J]. HYDROLOGY AND EARTH SYSTEM SCIENCES, 2024, 28 (17) : 4099 - 4126
  • [8] Data Subsetting: A Data-Centric Approach to Approximate Computing
    Kim, Younghoon
    Venkataramani, Swagath
    Chandrachoodan, Nitin
    Raghunathan, Anand
    [J]. 2019 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2019, : 576 - 581
  • [9] A DATA-CENTRIC APPROACH FOR INTEGRATED DATA CENTER MANAGEMENT
    Hoover, Christopher
    [J]. PROCEEDINGS OF THE ASME PACIFIC RIM TECHNICAL CONFERENCE AND EXHIBITION ON PACKAGING AND INTEGRATION OF ELECTRONIC AND PHOTONIC SYSTEMS, MEMS AND NEMS 2011, VOL 2, 2012, : 565 - 576
  • [10] A data-centric approach for predicting individual outcomes in a multi-party legislative system
    Viola, Roberta
    Drummond, Guilherme
    Veloso, Adriano
    Zuardi, Mauricio
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,