DNSMOS: A NON-INTRUSIVE PERCEPTUAL OBJECTIVE SPEECH QUALITY METRIC TO EVALUATE NOISE SUPPRESSORS

被引:114
|
作者
Reddy, Chandan K. A. [1 ]
Gopal, Vishak [1 ]
Cutler, Ross [1 ]
机构
[1] Microsoft Corp, Redmond, WA 98052 USA
关键词
Speech; Perceptual Speech Quality; Objective Metric; Deep Noise Suppressor; Metric; NETWORKS;
D O I
10.1109/ICASSP39728.2021.9414878
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Human subjective evaluation is the "gold standard" to evaluate speech quality optimized for human perception. Perceptual objective metrics serve as a proxy for subjective scores. The conventional and widely used metrics require a reference clean speech signal, which is unavailable in real recordings. Previous no-reference approaches correlate poorly with human ratings and are not widely adopted in the research community. One of the biggest use cases of these perceptual objective metrics is to evaluate noise suppression algorithms. This paper introduces a multi-stage self-teaching based perceptual objective metric that is designed to evaluate noise suppressors. The proposed method generalizes well in challenging test conditions with a high correlation to human ratings.
引用
收藏
页码:6493 / 6497
页数:5
相关论文
共 50 条
  • [41] Predicting score distribution to improve non-intrusive speech quality estimation
    Faridee, Abu Zaher Md
    Gamper, Hannes
    INTERSPEECH 2022, 2022, : 406 - 410
  • [42] Enhanced non-intrusive speech quality measurement using degradation models
    Falk, Tiago H.
    Chan, Wai-Yip
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 837 - 840
  • [43] ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications
    Yi, Gaoxiong
    Xiao, Wei
    Xiao, Yiming
    Naderi, Babak
    Moller, Sebastian
    Wardah, Wafaa
    Mittag, Gabriel
    Cutler, Ross
    Zhang, Zhuohuang
    Williamson, Donald S.
    Chen, Fei
    Yang, Fuzheng
    Shang, Shidong
    INTERSPEECH 2022, 2022, : 3308 - 3312
  • [44] A data-driven non-intrusive measure of speech quality and intelligibility
    Sharma, Dushyant
    Wang, Yu
    Naylor, Patrick A.
    Brookes, Mike
    SPEECH COMMUNICATION, 2016, 80 : 84 - 94
  • [45] Novel Deep Autoencoder Features for Non-intrusive Speech Quality Assessment
    Soni, Meet H.
    Patil, Hemant A.
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 2315 - 2319
  • [46] Coded Speech Quality Measurement by a Non-Intrusive PESQ-DNN
    Xu Z.
    Zhao Z.
    Fingscheidt T.
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2023, 31 : 3404 - 3417
  • [47] A CLASSIFICATION-AIDED FRAMEWORK FOR NON-INTRUSIVE SPEECH QUALITY ASSESSMENT
    Dong, Xuan
    Williamson, Donald S.
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 100 - 104
  • [48] Comparing neural network architectures for non-intrusive speech quality prediction
    Schill, Leif Forland
    Piechowiak, Tobias
    Laroche, Clement
    Mowlaee, Pejman
    SPEECH COMMUNICATION, 2024, 165
  • [49] Non-intrusive single-ended speech quality assessment in VoIP
    Ding, Lijing
    Lin, Zhong
    Radwan, Ayman
    El-Hennawey, Mohamed Samy
    Goubran, Rafik A.
    SPEECH COMMUNICATION, 2007, 49 (06) : 477 - 489
  • [50] New Non-intrusive Speech Quality Assessment Algorithm for Wireless Networks
    Akmalkhodzhaev, Akmal
    Kozlov, Alexander
    Intelligent Interactive Multimedia Systems and Services, 2015, 40 : 215 - 225