Extensive benchmarking of a method that estimates external model performance from limited statistical characteristics

被引：0

作者：

El-Hay, Tal ^{[1
]}

Reps, Jenna M. ^{[2
]}

Yanover, Chen ^{[1
]}

机构：

[1] KI Res Inst, Kfar Malal, Israel

[2] Janssen Res & Dev, Raritan, NJ USA

来源：

NPJ DIGITAL MEDICINE | 2025年 / 8卷 / 01期

关键词：

All Open Access; Gold;

D O I：

10.1038/s41746-024-01414-z

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Predictive model performance may deteriorate when applied to data sources that were not used for training, thus, external validation is a key step in successful model deployment. As access to patient-level external data sources is typically limited, we recently proposed a method that estimates external model performance using only external summary statistics. Here, we benchmark the proposed method on multiple tasks using five large heterogeneous US data sources, where each, in turn, plays the role of an internal source and the remaining-external. Results showed accurate estimations for all metrics: 95th error percentiles for the area under the receiver operating characteristics (discrimination), calibration-in-the-large (calibration), Brier and scaled Brier scores (overall accuracy) of 0.03, 0.08, 0.0002, and 0.07, respectively. These results demonstrate the feasibility of estimating the transportability of prediction models using an internal cohort and external statistics. It may become an important accelerator of model deployment.

引用

页数：10

共 50 条

[41] A method to calculate coverage probability from uncertainties in radiotherapy via a statistical shape model
Price, G. J.
Moore, C. J.
PHYSICS IN MEDICINE AND BIOLOGY, 2007, 52 (07): : 1947 - 1965
[42] A statistical method for using measured confounders to model uncertainty from unmeasured confounding.
McCandless, L. C.
Gustafson, P.
Levy, A. R.
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2007, 165 (11) : S88 - S88
[43] Statistical performance analysis of direct position determination method based on doppler shifts in presence of model errors
Wang, Ding
Wu, Ying
MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2017, 28 (01) : 149 - 182
[44] METHOD FOR ANALYZING PERFORMANCE IN ROD-AND-FRAME TEST .2. TEST OF STATISTICAL-MODEL
NYBORG, H
ISAKSEN, B
SCANDINAVIAN JOURNAL OF PSYCHOLOGY, 1974, 15 (02) : 124 - 126
[45] Statistical performance analysis of direct position determination method based on doppler shifts in presence of model errors
Ding Wang
Ying Wu
Multidimensional Systems and Signal Processing, 2017, 28 : 149 - 182
[46] STATISTICAL-INFERENCE FROM SINGLE CHANNEL RECORDS - 2-STATE MARKOV MODEL WITH LIMITED TIME RESOLUTION
YEO, GF
MILNE, RK
EDESON, RO
MADSEN, BW
PROCEEDINGS OF THE ROYAL SOCIETY SERIES B-BIOLOGICAL SCIENCES, 1988, 235 (1278): : 63 - 94
[47] APPROXIMATING SOIL-MOISTURE CHARACTERISTICS FROM LIMITED DATA - EMPIRICAL-EVIDENCE AND TENTATIVE MODEL
MCQUEEN, IS
MILLER, RF
WATER RESOURCES RESEARCH, 1974, 10 (03) : 521 - 527
[48] A new statistical method for evaluating long-term analytical performance of laboratories applied to an external quality assessment scheme for flow cytometry
Coucke, Wim
Van Blerk, Marjan
Libeer, Jean-Claude
Van Campenhout, Christel
Albert, Adelin
CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2010, 48 (05) : 645 - 650
[49] Statistical evaluation of the performance of gridded monthly precipitation products from reanalysis data, satellite estimates, and merged analyses over China
Deng, Xueliang
Nie, Suping
Deng, Weitao
Cao, Weihua
THEORETICAL AND APPLIED CLIMATOLOGY, 2018, 132 (1-2) : 621 - 637
[50] Statistical evaluation of the performance of gridded monthly precipitation products from reanalysis data, satellite estimates, and merged analyses over China
Xueliang Deng
Suping Nie
Weitao Deng
Weihua Cao
Theoretical and Applied Climatology, 2018, 132 : 621 - 637

← 1 2 3 4 5 →