The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement

被引:0
|
作者
de Oliveira, Danilo [1 ]
Welker, Simon [1 ]
Richter, Julius [1 ]
Gerkmann, Timo [1 ]
机构
[1] Univ Hamburg, Signal Proc, Hamburg, Germany
来源
INTERSPEECH 2024 | 2024年
关键词
speech enhancement evaluation; speech quality metrics;
D O I
10.21437/Interspeech.2024-2051
中图分类号
学科分类号
摘要
To obtain improved speech enhancement models, researchers often focus on increasing performance according to specific instrumental metrics. However, when the same metric is used in a loss function to optimize models, it may be detrimental to aspects that the given metric does not see. The goal of this paper is to illustrate the risk of overfitting a speech enhancement model to the metric used for evaluation. For this, we introduce enhancement models that exploit the widely used PESQ measure. Our "PESQetarian" model achieves 3.82 PESQ on VB-DMD while scoring very poorly in a listening experiment. While the obtained PESQ value of 3.82 would imply "state-of-the-art" PESQ-performance on the VB-DMD benchmark, our examples show that when optimizing w.r.t. a metric, an isolated evaluation on the same metric may be misleading. Instead, other metrics should be included in the evaluation and the resulting performance predictions should be confirmed by listening.
引用
收藏
页码:3854 / 3858
页数:5
相关论文
共 50 条
  • [41] LaPlace's Law and its potential relevance in comedo formation
    Kroodsma, C.
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2009, 129 : S67 - S67
  • [42] Response to Michael Goodhart's review of The Practical Turn in Political Theory
    Erman, Eva
    Moller, Niklas
    PERSPECTIVES ON POLITICS, 2019, 17 (04) : 1135 - 1135
  • [43] IMPROVEMENT OF SPEECH RESIDUALS FOR SPEECH ENHANCEMENT
    Elshamy, Samy
    Fingscheidt, Tim
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 219 - 223
  • [44] Speech enhancement with Gamma speech modeling
    Zou, Xia
    Chen, Liang
    Zhang, Xiong-Wei
    Tongxin Xuebao/Journal on Communications, 2006, 27 (10): : 118 - 123
  • [45] FOUR HOUR TARGET IN EMERGENCY DEPARTMENTS Goodhart's law: when waiting times became a target, they stopped being a good measure
    Crawford, S. Michael
    BMJ-BRITISH MEDICAL JOURNAL, 2017, 359
  • [46] RETRACTION: Zipf's Law for Speech Acts in Spoken English
    Qi, Da
    Wang, Hua
    JOURNAL OF QUANTITATIVE LINGUISTICS, 2024, 31 (04) : 0xxxi - 0xxxi
  • [47] Conflicting Conceptions of Hate Speech in the ECtHR's Case Law
    Sottiaux, Stefan
    GERMAN LAW JOURNAL, 2022, 23 (09): : 1193 - 1211
  • [48] Enhancement of dopaminergic neurotoxicity by the mercapturate of dopamine: Relevance to Parkinson's disease
    Zhang, J
    Kravtsov, V
    Amarnath, V
    Picklo, MJ
    Graham, DG
    Montine, TJ
    JOURNAL OF NEUROCHEMISTRY, 2000, 74 (03) : 970 - 978
  • [49] Overseas law: Speech for sale: Commerce and free speech in ICANN's new gTLD process
    Lipton, Jacqueline
    AUSTRALIAN LAW JOURNAL, 2013, 87 (01):
  • [50] Supervised Speech Enhancement
    Mohammadiha, Nasser
    Doclo, Simon
    BIOMEDICAL ENGINEERING-BIOMEDIZINISCHE TECHNIK, 2014, 59 : S729 - S729