The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement

被引:0
|
作者
de Oliveira, Danilo [1 ]
Welker, Simon [1 ]
Richter, Julius [1 ]
Gerkmann, Timo [1 ]
机构
[1] Univ Hamburg, Signal Proc, Hamburg, Germany
来源
关键词
speech enhancement evaluation; speech quality metrics;
D O I
10.21437/Interspeech.2024-2051
中图分类号
学科分类号
摘要
To obtain improved speech enhancement models, researchers often focus on increasing performance according to specific instrumental metrics. However, when the same metric is used in a loss function to optimize models, it may be detrimental to aspects that the given metric does not see. The goal of this paper is to illustrate the risk of overfitting a speech enhancement model to the metric used for evaluation. For this, we introduce enhancement models that exploit the widely used PESQ measure. Our "PESQetarian" model achieves 3.82 PESQ on VB-DMD while scoring very poorly in a listening experiment. While the obtained PESQ value of 3.82 would imply "state-of-the-art" PESQ-performance on the VB-DMD benchmark, our examples show that when optimizing w.r.t. a metric, an isolated evaluation on the same metric may be misleading. Instead, other metrics should be included in the evaluation and the resulting performance predictions should be confirmed by listening.
引用
收藏
页码:3854 / 3858
页数:5
相关论文
共 50 条
  • [31] Sacrificing commentary: Reading the end of literature - Goodhart,S
    Siebers, T
    PHILOSOPHY AND LITERATURE, 1997, 21 (02) : 487 - 489
  • [32] Benford's Law for Detecting Contrast Enhancement
    Moin, Syeda Shira
    Islam, Saiful
    2017 FOURTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2017, : 234 - 237
  • [33] Speech enhancement for bandlimited speech
    Heide, DA
    Kang, GS
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 393 - 396
  • [34] DISCRETE S-TRANSFORM BASED SPEECH ENHANCEMENT
    Hu, Guo-hua
    Li, Rui
    Tao, Liang
    INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2015, 8 (04) : 2231 - 2246
  • [35] Sacrificing commentary: Reading the end of literature - Goodhart,S
    McKenna, AJ
    RELIGION & LITERATURE, 1997, 29 (02) : 87 - 92
  • [36] Zipf's law and tactical/mobile speech communications
    Yavuz, D
    IEEE/AFCEA EUROCOMM 2000, CONFERENCE RECORD: INFORMATION SYSTEMS FOR ENHANCED PUBLIC SAFETY AND SECURITY, 2000, : 283 - 287
  • [37] The Exercise of Jurisdiction and the Absent Author of Law's Speech
    Ridler, Victoria L.
    LAW & LITERATURE, 2019, 31 (01) : 71 - 93
  • [38] NOTE INTERNATIONAL MEGAN's LAW AS COMPELLED SPEECH
    Genord, Alexandra R.
    MICHIGAN LAW REVIEW, 2020, 118 (08) : 1603 - 1628
  • [39] Relevance and limits of Mott's law in disordered insulators.
    Ladieu, F
    Sanquer, M
    ANNALES DE PHYSIQUE, 1996, 21 (03) : 267 - 336
  • [40] Relevance of collagen piezoelectricity to "Wolff's Law": A critical review
    Ahn, Andrew C.
    Grodzinsky, Alan J.
    MEDICAL ENGINEERING & PHYSICS, 2009, 31 (07) : 733 - 741