DNSMOS: A NON-INTRUSIVE PERCEPTUAL OBJECTIVE SPEECH QUALITY METRIC TO EVALUATE NOISE SUPPRESSORS

被引:114
|
作者
Reddy, Chandan K. A. [1 ]
Gopal, Vishak [1 ]
Cutler, Ross [1 ]
机构
[1] Microsoft Corp, Redmond, WA 98052 USA
关键词
Speech; Perceptual Speech Quality; Objective Metric; Deep Noise Suppressor; Metric; NETWORKS;
D O I
10.1109/ICASSP39728.2021.9414878
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Human subjective evaluation is the "gold standard" to evaluate speech quality optimized for human perception. Perceptual objective metrics serve as a proxy for subjective scores. The conventional and widely used metrics require a reference clean speech signal, which is unavailable in real recordings. Previous no-reference approaches correlate poorly with human ratings and are not widely adopted in the research community. One of the biggest use cases of these perceptual objective metrics is to evaluate noise suppression algorithms. This paper introduces a multi-stage self-teaching based perceptual objective metric that is designed to evaluate noise suppressors. The proposed method generalizes well in challenging test conditions with a high correlation to human ratings.
引用
收藏
页码:6493 / 6497
页数:5
相关论文
共 50 条
  • [1] DNSMOS P.835: A NON-INTRUSIVE PERCEPTUAL OBJECTIVE SPEECH QUALITY METRIC TO EVALUATE NOISE SUPPRESSORS
    Reddy, Chandan K. A.
    Gopal, Vishak
    Cutler, Ross
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 886 - 890
  • [2] Perceptual model for non-intrusive speech quality assessment
    Kim, DS
    Tarraf, A
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 1060 - 1063
  • [3] NON-INTRUSIVE SPEECH QUALITY OBJECTIVE EVALUATION IN HIGH-NOISE ENVIRONMENTS
    Zhou, Weili
    He, Qianhua
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 50 - 54
  • [4] Enhanced perceptual model for non-intrusive speech quality assessment
    Kim, Doh-Suk
    Tarraf, Ahmed
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 829 - 832
  • [5] Objective Speech Quality Assessment with Non-intrusive Method for Narrowband Speech
    Wang, Jing
    Luo, Juan
    Zhao, Shenghui
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 518 - 521
  • [6] Non-intrusive Objective Evaluation of Speech Quality in Noisy Condition
    Islam, Md. Rafidul
    Rahman, Md. Ashequr
    Hasan, Md. Numan
    Hossain, A. N. M. Shahriyar
    Uddin, Ahmed Nazim
    Haque, Mohammad Ariful
    2016 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (ICECE), 2016, : 586 - 589
  • [7] INTRUSIVE AND NON-INTRUSIVE PERCEPTUAL SPEECH QUALITY ASSESSMENT USING A CONVOLUTIONAL NEURAL NETWORK
    Gamper, Hannes
    Reddy, Chandan K. A.
    Cutler, Ross
    Tashev, Ivan J.
    Gehrke, Johannes
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 85 - 89
  • [8] A Novel Non-intrusive Objective Speech Quality Measurement based on GMM and SVR
    Wang, Jing
    Luo, Juan
    Zhao, Shenghui
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 691 - 694
  • [9] Non-intrusive Objective Speech Quality Measurement based on GMM and SVR for Narrowband and Wideband Speech
    Wang, Jing
    Luo, Juan
    Zhao, Shenghui
    Kuang, Jingming
    2008 11TH IEEE SINGAPORE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS (ICCS), VOLS 1-3, 2008, : 193 - 198
  • [10] Perceptual non-intrusive speech quality assessment using a self-organizing map
    Mahdi, Abdulhussain E.
    JOURNAL OF ENTERPRISE INFORMATION MANAGEMENT, 2006, 19 (02) : 148 - +