DNSMOS: A NON-INTRUSIVE PERCEPTUAL OBJECTIVE SPEECH QUALITY METRIC TO EVALUATE NOISE SUPPRESSORS

被引:114
|
作者
Reddy, Chandan K. A. [1 ]
Gopal, Vishak [1 ]
Cutler, Ross [1 ]
机构
[1] Microsoft Corp, Redmond, WA 98052 USA
关键词
Speech; Perceptual Speech Quality; Objective Metric; Deep Noise Suppressor; Metric; NETWORKS;
D O I
10.1109/ICASSP39728.2021.9414878
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Human subjective evaluation is the "gold standard" to evaluate speech quality optimized for human perception. Perceptual objective metrics serve as a proxy for subjective scores. The conventional and widely used metrics require a reference clean speech signal, which is unavailable in real recordings. Previous no-reference approaches correlate poorly with human ratings and are not widely adopted in the research community. One of the biggest use cases of these perceptual objective metrics is to evaluate noise suppression algorithms. This paper introduces a multi-stage self-teaching based perceptual objective metric that is designed to evaluate noise suppressors. The proposed method generalizes well in challenging test conditions with a high correlation to human ratings.
引用
收藏
页码:6493 / 6497
页数:5
相关论文
共 50 条
  • [21] Effectiveness of Ideal Ratio Mask for Non-intrusive Quality Assessment of Noise Suppressed Speech
    Soni, Meet H.
    Patil, Hemant A.
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 573 - 577
  • [22] Specialized Speech Enhancement Model Selection Based on Learned Non-intrusive Quality Assessment Metric
    Zezario, Ryandhimas E.
    Fu, Szu-Wei
    Lu, Xugang
    Wang, Hsin-Min
    Tsao, Yu
    INTERSPEECH 2019, 2019, : 3168 - 3172
  • [23] Novel Subband Autoencoder Features for Non-intrusive Quality Assessment of Noise Suppressed Speech
    Soni, Meet H.
    Patil, Hemant A.
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3708 - 3712
  • [24] Transformer Networks for Non-Intrusive Speech Quality Prediction
    Jayesh, M. K.
    Sharma, Mukesh
    Vonteddu, Praneeth
    Shaik, M. A. B.
    Ganapathy, Sriram
    INTERSPEECH 2022, 2022, : 4078 - 4082
  • [25] A Bayesian Approach to Non-Intrusive Quality Assessment of Speech
    Petkov, Petko N.
    Mossavat, Iman S.
    Kleijn, W. Bastiaan
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2839 - +
  • [26] Non-intrusive Diagnostic Monitoring of Fullband Speech Quality
    Moeller, Sebastian
    Huebschen, Tobias
    Michael, Thilo
    Mittag, Gabriel
    Schmidt, Gerhard
    INTERSPEECH 2020, 2020, : 2872 - 2876
  • [27] Non-intrusive Objective Speech Quality Assessment using a Combination of MFCC, PLP and LSF Features
    Dubey, Rajesh Kumar
    Kumar, Arun
    2013 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICSC), 2013, : 297 - 302
  • [28] NON-INTRUSIVE OBJECTIVE SPEECH QUALITY AND INTELLIGIBILITY PREDICTION FOR HEARING INSTRUMENTS IN COMPLEX LISTENING ENVIRONMENTS
    Falk, Tiago H.
    Cosentino, Stefano
    Santos, Joao
    Suelzle, David
    Parsa, Vijay
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7820 - 7824
  • [29] Non-intrusive assessment of speech quality for classical Chinese poetry recitals using perceptual and acoustic features
    Liu, Ganjun
    Zhang, Tao
    Guo, Haoyang
    Wen, Junhuan
    Lv, Ying
    APPLIED ACOUSTICS, 2024, 216
  • [30] PERFORMANCE COMPARISON OF INTRUSIVE AND NON-INTRUSIVE INSTRUMENTAL QUALITY MEASURES FOR ENHANCED SPEECH
    Avila, Anderson
    Cauchi, Benjamin
    Goetze, Stefan
    Doclo, Simon
    Falk, Tiago
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,