When best is the enemy of good - critical evaluation of performance criteria in hydrological models

被引:9
|
作者
Cinkus, Guillaume [1 ]
Mazzilli, Naomi [2 ]
Jourde, Herve [1 ]
Wunsch, Andreas [3 ]
Liesch, Tanja [3 ]
Ravbar, Natasa [4 ]
Chen, Zhao [5 ]
Goldscheider, Nico [3 ]
机构
[1] Univ Montpellier, Hydrosci Montpellier HSM, CNRS, IRD, F-34090 Montpellier, France
[2] Univ Avignon, UMR EMMAH AU INRAE 1114, F-84000 Avignon, France
[3] Karlsruhe Inst Technol KIT, Inst Appl Geosci, Kaiserstr 12, D-76131 Karlsruhe, Germany
[4] ZRC SAZU, Karst Res Inst, Titov Trg 2, Postojna 6230, Slovenia
[5] Tech Univ Dresden, Inst Groundwater Management, D-01062 Dresden, Germany
关键词
CALIBRATION; SENSITIVITY; EFFICIENCY; MULTIPLE; METRICS; ERROR;
D O I
10.5194/hess-27-2397-2023
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Performance criteria play a key role in the calibration and evaluation of hydrological models and have been extensively developed and studied, but some of the most used criteria still have unknown pitfalls. This study set out to examine counterbalancing errors, which are inherent to the Kling-Gupta efficiency (KGE) and its variants. A total of nine performance criteria - including the KGE and its variants, as well as the Nash-Sutcliffe efficiency (NSE) and the modified index of agreement (d(1)) - were analysed using synthetic time series and a real case study. Results showed that, when assessing a simulation, the score of the KGE and some of its variants can be increased by concurrent overestimation and underestimation of discharge. These counterbalancing errors may favour bias and variability parameters, therefore preserving an overall high score of the performance criteria. As bias and variability parameters generally account for two-thirds of the weight in the equation of performance criteria such as the KGE, this can lead to an overall higher criterion score without being associated with an increase in model relevance. We recommend using (i) performance criteria that are not or less prone to counterbalancing errors (d(1), modified KGE, non-parametric KGE, diagnostic efficiency) and/or (ii) scaling factors in the equation to reduce the influence of relative parameters.
引用
下载
收藏
页码:2397 / 2411
页数:15
相关论文
共 50 条
  • [31] When Does It Pay to be Good? Moderators and Mediators in the Corporate Sustainability-Corporate Financial Performance Relationship: A Critical Review
    Grewatsch, Sylvia
    Kleindienst, Ingo
    JOURNAL OF BUSINESS ETHICS, 2017, 145 (02) : 383 - 416
  • [32] Performance evaluation of FAO Penman-Monteith and best alternative models for estimating reference evapotranspiration in Bangladesh
    Islam, Shakibul
    Alam, A. K. M. Rashidul
    HELIYON, 2021, 7 (07)
  • [33] River flow modelling: comparison of performance and evaluation of uncertainty using data-driven models and conceptual hydrological model
    Zhang, Zhenghao
    Zhang, Qiang
    Singh, Vijay P.
    Shi, Peijun
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2018, 32 (09) : 2667 - 2682
  • [34] River flow modelling: comparison of performance and evaluation of uncertainty using data-driven models and conceptual hydrological model
    Zhenghao Zhang
    Qiang Zhang
    Vijay P. Singh
    Peijun Shi
    Stochastic Environmental Research and Risk Assessment, 2018, 32 : 2667 - 2682
  • [35] A time series tool to support the multi-criteria performance evaluation of rainfall-runoff models
    Willems, Patrick
    ENVIRONMENTAL MODELLING & SOFTWARE, 2009, 24 (03) : 311 - 321
  • [36] Evaluation of polygenic risk models using multiple performance measures: a critical assessment of discordant results
    Martens, Forike K.
    Tonk, Elisa C. M.
    Janssens, A. Cecile J. W.
    GENETICS IN MEDICINE, 2019, 21 (02) : 391 - 397
  • [37] Flood spatial coherence, triggers, and performance in hydrological simulations: large-sample evaluation of four streamflow-calibrated models
    Brunner, Manuela, I
    Melsen, Lieke A.
    Wood, Andrew W.
    Rakovec, Oldrich
    Mizukami, Naoki
    Knoben, Wouter J. M.
    Clark, Martyn P.
    HYDROLOGY AND EARTH SYSTEM SCIENCES, 2021, 25 (01) : 105 - 119
  • [38] Comparative evaluation and performance of large language models on expert level critical care questions: a benchmark study
    Jessica D. Workum
    Bas W. S. Volkers
    Davy van de Sande
    Sumesh Arora
    Marco Goeijenbier
    Diederik Gommers
    Michel E. van Genderen
    Critical Care, 29 (1):
  • [39] Critical evaluation of two models of flow cytometers for the assessment of sperm DNA fragmentation: an appeal for performance verification
    Sharma, Rakesh
    Gupta, Sajal
    Henkel, Ralf
    Agarwal, Ashok
    ASIAN JOURNAL OF ANDROLOGY, 2019, 21 (05) : 438 - 444
  • [40] Formulation of Wavelet Based Multi-Scale Multi-Objective Performance Evaluation (WMMPE) Metric for Improved Calibration of Hydrological Models
    Manikanta, Velpuri
    Vema, Vamsi Krishna
    WATER RESOURCES RESEARCH, 2022, 58 (07)