Performance improvement of a non-intrusive voice quality metric in lossy networks

被引:7
|
作者
Nunes, Rodrigo Dantas [1 ]
Rosa, Renata Lopes [1 ]
Rodriguez, Demostenes Zegarra [1 ]
机构
[1] Univ Fed Lavras, Lavras, MG, Brazil
基金
巴西圣保罗研究基金会;
关键词
speech processing; mean square error methods; mobile handsets; correlation theory; lossy network; phone calls; mobile service providers; audio signal; speech signal; packet loss rate value; voice quality server; mobile device; P; 563 algorithm performance; nonintrusive voice quality metric assessment; ITU-T Rec; 563; algorithm; Pearson correlation coefficient; root mean square error; SPEECH;
D O I
10.1049/iet-com.2018.5165
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Voice quality assessment of phone calls is a relevant task for mobile service providers. In this context, the main objective of this research is to provide a model that improves the performance of ITU-T Rec. P.563. To accomplish this objective, the proposed model considers two aspects, better response in lossy network and adequate treatment of silences segments into the audio signal. Thus, a function is determined to suppress silences in the speech signal according to the packet loss rate value. Furthermore, the proposed model is implemented on both a voice quality server and a mobile device. Experimental results show that P.563 algorithm performance was really improved by the proposed model, approximating its results to those given by P.862 algorithm, reaching a Pearson correlation coefficient of 0.9957 and a root mean square error of 0.2983. Moreover, subjective test results demonstrated that the proposed model results overcome those obtained by the P.563 algorithm.
引用
收藏
页码:3401 / 3408
页数:8
相关论文
共 50 条
  • [31] Non-Intrusive Technique for Pathological Voice Classification using Jitter And Shimmer
    Sripriya, N.
    Poornima, S.
    Shivaranjani, R.
    Thangaraju, Preethi
    2017 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND SIGNAL PROCESSING (ICCCSP), 2017, : 240 - 245
  • [32] Non-intrusive speech quality assessment using context-aware neural networks
    Jaiswal R.K.
    Dubey R.K.
    International Journal of Speech Technology, 2022, 25 (04) : 947 - 965
  • [33] Perceptual model for non-intrusive speech quality assessment
    Kim, DS
    Tarraf, A
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 1060 - 1063
  • [34] Discrete Choice Models for Non-Intrusive Quality Assessment
    Petkov, Petko N.
    Kleijn, W. Bastiaan
    de Vries, Bert
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 200 - +
  • [35] A Bayesian Approach to Non-Intrusive Quality Assessment of Speech
    Petkov, Petko N.
    Mossavat, Iman S.
    Kleijn, W. Bastiaan
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2839 - +
  • [36] Non-intrusive Diagnostic Monitoring of Fullband Speech Quality
    Moeller, Sebastian
    Huebschen, Tobias
    Michael, Thilo
    Mittag, Gabriel
    Schmidt, Gerhard
    INTERSPEECH 2020, 2020, : 2872 - 2876
  • [37] Non-intrusive quality monitoring method of VoIP speech based on network performance metrics
    Masuda, M
    Hayashi, T
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2006, E89B (02) : 304 - 312
  • [38] Non-intrusive quantification of performance and its relationship to mood
    Davide Carneiro
    André Pimenta
    José Neves
    Paulo Novais
    Soft Computing, 2017, 21 : 4917 - 4923
  • [39] Non-intrusive quantification of performance and its relationship to mood
    Carneiro, Davide
    Pimenta, Andre
    Neves, Jose
    Novais, Paulo
    SOFT COMPUTING, 2017, 21 (17) : 4917 - 4923
  • [40] High performance non-intrusive distributed CORBA monitoring
    Vermeulen, B
    De Reu, D
    Dhoedt, B
    Demeester, P
    2002 IEEE WORKSHOP ON IP OPERATIONS AND MANAGEMENT, 2002, : 116 - 120