When best is the enemy of good - critical evaluation of performance criteria in hydrological models

被引：9

作者：

Cinkus, Guillaume ^{[1
]}

Mazzilli, Naomi ^{[2
]}

Jourde, Herve ^{[1
]}

Wunsch, Andreas ^{[3
]}

Liesch, Tanja ^{[3
]}

Ravbar, Natasa ^{[4
]}

Chen, Zhao ^{[5
]}

Goldscheider, Nico ^{[3
]}

机构：

[1] Univ Montpellier, Hydrosci Montpellier HSM, CNRS, IRD, F-34090 Montpellier, France

[2] Univ Avignon, UMR EMMAH AU INRAE 1114, F-84000 Avignon, France

[3] Karlsruhe Inst Technol KIT, Inst Appl Geosci, Kaiserstr 12, D-76131 Karlsruhe, Germany

[4] ZRC SAZU, Karst Res Inst, Titov Trg 2, Postojna 6230, Slovenia

[5] Tech Univ Dresden, Inst Groundwater Management, D-01062 Dresden, Germany

来源：

HYDROLOGY AND EARTH SYSTEM SCIENCES | 2023年 / 27卷 / 13期

关键词：

CALIBRATION; SENSITIVITY; EFFICIENCY; MULTIPLE; METRICS; ERROR;

D O I：

10.5194/hess-27-2397-2023

中图分类号：

P [天文学、地球科学];

学科分类号：

07 ;

摘要：

Performance criteria play a key role in the calibration and evaluation of hydrological models and have been extensively developed and studied, but some of the most used criteria still have unknown pitfalls. This study set out to examine counterbalancing errors, which are inherent to the Kling-Gupta efficiency (KGE) and its variants. A total of nine performance criteria - including the KGE and its variants, as well as the Nash-Sutcliffe efficiency (NSE) and the modified index of agreement (d(1)) - were analysed using synthetic time series and a real case study. Results showed that, when assessing a simulation, the score of the KGE and some of its variants can be increased by concurrent overestimation and underestimation of discharge. These counterbalancing errors may favour bias and variability parameters, therefore preserving an overall high score of the performance criteria. As bias and variability parameters generally account for two-thirds of the weight in the equation of performance criteria such as the KGE, this can lead to an overall higher criterion score without being associated with an increase in model relevance. We recommend using (i) performance criteria that are not or less prone to counterbalancing errors (d(1), modified KGE, non-parametric KGE, diagnostic efficiency) and/or (ii) scaling factors in the equation to reduce the influence of relative parameters.

引用

下载

页码：2397 / 2411

页数：15

共 50 条

[31] When Does It Pay to be Good? Moderators and Mediators in the Corporate Sustainability-Corporate Financial Performance Relationship: A Critical Review
Grewatsch, Sylvia
Kleindienst, Ingo
JOURNAL OF BUSINESS ETHICS, 2017, 145 (02) : 383 - 416
[32] Performance evaluation of FAO Penman-Monteith and best alternative models for estimating reference evapotranspiration in Bangladesh
Islam, Shakibul
Alam, A. K. M. Rashidul
HELIYON, 2021, 7 (07)
[33] River flow modelling: comparison of performance and evaluation of uncertainty using data-driven models and conceptual hydrological model
Zhang, Zhenghao
Zhang, Qiang
Singh, Vijay P.
Shi, Peijun
STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2018, 32 (09) : 2667 - 2682
[34] River flow modelling: comparison of performance and evaluation of uncertainty using data-driven models and conceptual hydrological model
Zhenghao Zhang
Qiang Zhang
Vijay P. Singh
Peijun Shi
Stochastic Environmental Research and Risk Assessment, 2018, 32 : 2667 - 2682
[35] A time series tool to support the multi-criteria performance evaluation of rainfall-runoff models
Willems, Patrick
ENVIRONMENTAL MODELLING & SOFTWARE, 2009, 24 (03) : 311 - 321
[36] Evaluation of polygenic risk models using multiple performance measures: a critical assessment of discordant results
Martens, Forike K.
Tonk, Elisa C. M.
Janssens, A. Cecile J. W.
GENETICS IN MEDICINE, 2019, 21 (02) : 391 - 397
[37] Flood spatial coherence, triggers, and performance in hydrological simulations: large-sample evaluation of four streamflow-calibrated models
Brunner, Manuela, I
Melsen, Lieke A.
Wood, Andrew W.
Rakovec, Oldrich
Mizukami, Naoki
Knoben, Wouter J. M.
Clark, Martyn P.
HYDROLOGY AND EARTH SYSTEM SCIENCES, 2021, 25 (01) : 105 - 119
[38] Comparative evaluation and performance of large language models on expert level critical care questions: a benchmark study
Jessica D. Workum
Bas W. S. Volkers
Davy van de Sande
Sumesh Arora
Marco Goeijenbier
Diederik Gommers
Michel E. van Genderen
Critical Care, 29 (1):
[39] Critical evaluation of two models of flow cytometers for the assessment of sperm DNA fragmentation: an appeal for performance verification
Sharma, Rakesh
Gupta, Sajal
Henkel, Ralf
Agarwal, Ashok
ASIAN JOURNAL OF ANDROLOGY, 2019, 21 (05) : 438 - 444
[40] Formulation of Wavelet Based Multi-Scale Multi-Objective Performance Evaluation (WMMPE) Metric for Improved Calibration of Hydrological Models
Manikanta, Velpuri
Vema, Vamsi Krishna
WATER RESOURCES RESEARCH, 2022, 58 (07)

← 1 2 3 4 5 →