On the Assessment of High-Quality Voice Recordings including Voice Postprocessing

被引：2

作者：

Beerends, John G. ^{[1
]}

Beerends, Imre ^{[2
]}

机构：

[1] TNO, NL-2509 JE The Hague, Netherlands

[2] Mantis Audio, Wateringen, Netherlands

来源：

JOURNAL OF THE AUDIO ENGINEERING SOCIETY | 2015年 / 63卷 / 03期

关键词：

ITU-T STANDARD; ASSESSMENT POLQA;

D O I：

10.17743/jaes.2015.0013

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

When we assess the quality of a voice recording two different aspects play a role the voice characteristics (voice quality) and the audio chain characteristics (audio quality). Subjective experiments where no clear ideal reference is provided, so called absolute category rating experiments, assess the speech quality, i.e., the combined effect of voice and audio quality. This paper investigates whether voice postprocessing such as timbre optimization, loudness optimization; de-essing, room reverberation optimization, and (background) noise suppression can improve the quality of a high quality voice recording. It turned out that none of the processing provides a significant improvement in perceived quality. The best postprocessing is noise reduction to absolute silence, delivering only a non-significant improvement when the voice recording is of high quality. The subjective quality evaluations show a significant preference of male over female voice and a significant effect of speaker/sentence dependency on the perceived quality of certain types of degradation. The subjective results are compared with predictions made with the ITU-T standard for the objective assessment of speech quality POLQA (ITU-T Recommendation P.863 versions 1.1 and 2.4) and shows that many speech quality effects are predicted correctly, on condition level as well as individual sentence level.

引用

页码：174 / 183

页数：10

共 50 条

[41] Techniques for Obtaining High-quality Recordings in Electrocochleography
Simpson, Michael J.
Jennings, Skyler G.
Margolis, Robert H.
FRONTIERS IN SYSTEMS NEUROSCIENCE, 2020, 14
[42] The Acoustic Voice Quality Index: Toward improved treatment outcomes assessment in voice disorders
Maryn, Youri
De Bodt, Marc
Roy, Nelson
JOURNAL OF COMMUNICATION DISORDERS, 2010, 43 (03) : 161 - 174
[43] VOICE ANONYMIZATION IN URBAN SOUND RECORDINGS
Cohen-Hadria, Alice
Cartwright, Mark
McFee, Brian
Bello, Juan Pablo
2019 IEEE 29TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2019,
[44] High-quality voice conversion system based on GMM statistical parameters and RBF neural network
CHEN Xian-tong
ZHANG Ling-hua
The Journal of China Universities of Posts and Telecommunications, 2014, (05) : 68 - 75
[45] Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis
Wang, Yu
Wang, Xinsheng
Zhu, Pengcheng
Wu, Jie
Li, Hanzhao
Xue, Heyang
Zhang, Yongmao
Xie, Lei
Bi, Mengxiao
INTERSPEECH 2022, 2022, : 4242 - 4246
[46] High-quality voice conversion system based on GMM statistical parameters and RBF neural network
CHEN Xian-tong
ZHANG Ling-hua
TheJournalofChinaUniversitiesofPostsandTelecommunications, 2014, 21 (05) : 68 - 75+93
[47] Clinical Usefulness of Voice Recordings using a Smartphone as a Screening Tool for Voice Disorders
Lee, Seung Jin
Lee, Kwang Yong
Choi, Hong-Shik
COMMUNICATION SCIENCES AND DISORDERS-CSD, 2018, 23 (04): : 1065 - 1077
[48] Venezuelan voice database for voice quality testing
Jimenez, Jesus J. G.
Diaz, Jose. A.
Pacheco, Jose
INGENIERIA UC, 2013, 20 (01): : 17 - 24
[49] On voice quality of IP voice over GPRS
Lakaniemi, A
Parantainen, J
2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 751 - 754
[50] The acceptance and voice quality of a new voice prosthesis Vega High performance' - a feasibility study
Heirman, Anne N.
Tellman, Roosmarijn Sophie
van der Molen, Lisette
van Son, Rob
van Sluis, Klaske
Halmos, Gyorgy Bela
van den Brekel, Michiel Wilhemus Maria
Dirven, Richard
ACTA OTO-LARYNGOLOGICA, 2023, 143 (08) : 721 - 729

← 1 2 3 4 5 →