Performance improvement of a non-intrusive voice quality metric in lossy networks

被引:7
|
作者
Nunes, Rodrigo Dantas [1 ]
Rosa, Renata Lopes [1 ]
Rodriguez, Demostenes Zegarra [1 ]
机构
[1] Univ Fed Lavras, Lavras, MG, Brazil
基金
巴西圣保罗研究基金会;
关键词
speech processing; mean square error methods; mobile handsets; correlation theory; lossy network; phone calls; mobile service providers; audio signal; speech signal; packet loss rate value; voice quality server; mobile device; P; 563 algorithm performance; nonintrusive voice quality metric assessment; ITU-T Rec; 563; algorithm; Pearson correlation coefficient; root mean square error; SPEECH;
D O I
10.1049/iet-com.2018.5165
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Voice quality assessment of phone calls is a relevant task for mobile service providers. In this context, the main objective of this research is to provide a model that improves the performance of ITU-T Rec. P.563. To accomplish this objective, the proposed model considers two aspects, better response in lossy network and adequate treatment of silences segments into the audio signal. Thus, a function is determined to suppress silences in the speech signal according to the packet loss rate value. Furthermore, the proposed model is implemented on both a voice quality server and a mobile device. Experimental results show that P.563 algorithm performance was really improved by the proposed model, approximating its results to those given by P.862 algorithm, reaching a Pearson correlation coefficient of 0.9957 and a root mean square error of 0.2983. Moreover, subjective test results demonstrated that the proposed model results overcome those obtained by the P.563 algorithm.
引用
收藏
页码:3401 / 3408
页数:8
相关论文
共 50 条
  • [41] DNSMOS P.835: A NON-INTRUSIVE PERCEPTUAL OBJECTIVE SPEECH QUALITY METRIC TO EVALUATE NOISE SUPPRESSORS
    Reddy, Chandan K. A.
    Gopal, Vishak
    Cutler, Ross
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 886 - 890
  • [42] NIML: non-intrusive machine learning-based speech quality prediction on VoIP networks
    Alkhawaldeh, Rami S.
    Khawaldeh, Saed
    Pervaiz, Usama
    Alawida, Moatsum
    Alkhawaldeh, Hamzah
    IET COMMUNICATIONS, 2019, 13 (16) : 2609 - 2616
  • [43] Non-Intrusive Appliance Identification with Appliance-Specific Networks
    Fang, Zhaoyuan
    Zhao, Dongbo
    Chen, Chen
    Li, Yang
    Tian, Yuting
    2019 IEEE INDUSTRY APPLICATIONS SOCIETY ANNUAL MEETING, 2019,
  • [44] INTRUSIVE AND NON-INTRUSIVE PERCEPTUAL SPEECH QUALITY ASSESSMENT USING A CONVOLUTIONAL NEURAL NETWORK
    Gamper, Hannes
    Reddy, Chandan K. A.
    Cutler, Ross
    Tashev, Ivan J.
    Gehrke, Johannes
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 85 - 89
  • [45] A Non-intrusive Failure Prediction Mechanism for Deployed Optical Networks
    Das, Dibakar
    Imteyaz, Mohammad Fahad
    Bapat, Jyotsna
    Das, Debabrata
    2021 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2021, : 24 - 28
  • [46] Non-intrusive GMM-based speech quality measurement
    Falk, TH
    Xu, QF
    Chan, WY
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 125 - 128
  • [47] Non-intrusive Speech Quality Assessment with Support Vector Regression
    Narwaria, Manish
    Lin, Weisi
    McLoughlin, Ian Vince
    Emmanuel, Sabu
    Tien, Chia Liang
    ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 325 - 335
  • [48] Enhanced perceptual model for non-intrusive speech quality assessment
    Kim, Doh-Suk
    Tarraf, Ahmed
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 829 - 832
  • [49] Non-Intrusive Harmonic Source Identification Using Neural Networks
    Janani, K.
    Himavathi, S.
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON COMPUTATION OF POWER, ENERGY, INFORMATION AND COMMUNICATION (ICCPEIC 2013), 2013, : 59 - 64
  • [50] Novel non-intrusive model of fullness perception of sound quality
    Lu, Jing
    Wan, Wanggen
    Yu, Xiaoqing
    Journal of Computational Information Systems, 2010, 6 (03): : 905 - 911