On the Assessment of High-Quality Voice Recordings including Voice Postprocessing

被引:2
|
作者
Beerends, John G. [1 ]
Beerends, Imre [2 ]
机构
[1] TNO, NL-2509 JE The Hague, Netherlands
[2] Mantis Audio, Wateringen, Netherlands
来源
关键词
ITU-T STANDARD; ASSESSMENT POLQA;
D O I
10.17743/jaes.2015.0013
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
When we assess the quality of a voice recording two different aspects play a role the voice characteristics (voice quality) and the audio chain characteristics (audio quality). Subjective experiments where no clear ideal reference is provided, so called absolute category rating experiments, assess the speech quality, i.e., the combined effect of voice and audio quality. This paper investigates whether voice postprocessing such as timbre optimization, loudness optimization; de-essing, room reverberation optimization, and (background) noise suppression can improve the quality of a high quality voice recording. It turned out that none of the processing provides a significant improvement in perceived quality. The best postprocessing is noise reduction to absolute silence, delivering only a non-significant improvement when the voice recording is of high quality. The subjective quality evaluations show a significant preference of male over female voice and a significant effect of speaker/sentence dependency on the perceived quality of certain types of degradation. The subjective results are compared with predictions made with the ITU-T standard for the objective assessment of speech quality POLQA (ITU-T Recommendation P.863 versions 1.1 and 2.4) and shows that many speech quality effects are predicted correctly, on condition level as well as individual sentence level.
引用
收藏
页码:174 / 183
页数:10
相关论文
共 50 条
  • [41] Techniques for Obtaining High-quality Recordings in Electrocochleography
    Simpson, Michael J.
    Jennings, Skyler G.
    Margolis, Robert H.
    FRONTIERS IN SYSTEMS NEUROSCIENCE, 2020, 14
  • [42] The Acoustic Voice Quality Index: Toward improved treatment outcomes assessment in voice disorders
    Maryn, Youri
    De Bodt, Marc
    Roy, Nelson
    JOURNAL OF COMMUNICATION DISORDERS, 2010, 43 (03) : 161 - 174
  • [43] VOICE ANONYMIZATION IN URBAN SOUND RECORDINGS
    Cohen-Hadria, Alice
    Cartwright, Mark
    McFee, Brian
    Bello, Juan Pablo
    2019 IEEE 29TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2019,
  • [44] High-quality voice conversion system based on GMM statistical parameters and RBF neural network
    CHEN Xian-tong
    ZHANG Ling-hua
    The Journal of China Universities of Posts and Telecommunications, 2014, (05) : 68 - 75
  • [45] Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis
    Wang, Yu
    Wang, Xinsheng
    Zhu, Pengcheng
    Wu, Jie
    Li, Hanzhao
    Xue, Heyang
    Zhang, Yongmao
    Xie, Lei
    Bi, Mengxiao
    INTERSPEECH 2022, 2022, : 4242 - 4246
  • [46] High-quality voice conversion system based on GMM statistical parameters and RBF neural network
    CHEN Xian-tong
    ZHANG Ling-hua
    TheJournalofChinaUniversitiesofPostsandTelecommunications, 2014, 21 (05) : 68 - 75+93
  • [47] Clinical Usefulness of Voice Recordings using a Smartphone as a Screening Tool for Voice Disorders
    Lee, Seung Jin
    Lee, Kwang Yong
    Choi, Hong-Shik
    COMMUNICATION SCIENCES AND DISORDERS-CSD, 2018, 23 (04): : 1065 - 1077
  • [48] Venezuelan voice database for voice quality testing
    Jimenez, Jesus J. G.
    Diaz, Jose. A.
    Pacheco, Jose
    INGENIERIA UC, 2013, 20 (01): : 17 - 24
  • [49] On voice quality of IP voice over GPRS
    Lakaniemi, A
    Parantainen, J
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 751 - 754
  • [50] The acceptance and voice quality of a new voice prosthesis Vega High performance' - a feasibility study
    Heirman, Anne N.
    Tellman, Roosmarijn Sophie
    van der Molen, Lisette
    van Son, Rob
    van Sluis, Klaske
    Halmos, Gyorgy Bela
    van den Brekel, Michiel Wilhemus Maria
    Dirven, Richard
    ACTA OTO-LARYNGOLOGICA, 2023, 143 (08) : 721 - 729