Glottal source estimation robustness - A comparison of sensitivity of voice source estimation techniques

被引：0

作者：

Drugman, Thomas ^{[1
]}

Dubuisson, Thomas ^{[1
]}

Moinet, Alexis ^{[1
]}

D'Alessandro, Nicolas ^{[1
]}

Dutoit, Thierry ^{[1
]}

机构：

[1] Fac Polytech Mons, TCTS Lab, B-7000 Mons, Belgium

来源：

SIGMAP 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS | 2008年

关键词：

speech processing; speech analysis; voice source; glottal formant;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper addresses the problem of estimating the voice source directly from speech waveforms. A novel principle based on Anticausality Dominated Regions (ACDR) is used to estimate the glottal open phase. This technique is compared to two other state-of-the-art well-known methods, namely the Zeros of the Z-Transform (ZZT) and the Iterative Adaptive Inverse Filtering (IAIF) algorithms. Decomposition quality is assessed on synthetic signals through two objective measures: the spectral distortion and a glottal formant determination rate. Technique robustness is tested by analyzing the influence of noise and Glottal Closure Instant (GCI) location errors. Besides impacts of the fundamental frequency and the first formant on the performance are evaluated. Our proposed approach shows significant improvement in robustness, which could be of a great interest when decomposing real speech.

引用

页码：202 / 207

页数：6

共 50 条

[31] Analysis and Detection of Pathological Voice Using Glottal Source Features
Kadiri, Sudarsana Reddy
Alku, Paavo
[J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (02) : 367 - 379
[32] Assessment of disordered voice based on an optimized glottal source model
Boudjerda, Mounir
Kacha, Abdellah
[J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (04) : 3201 - 3214
[33] Glottal source estimation from coded telephone speech using a deep neural network
Narendra, N. P.
Airaksinen, Manu
Alku, Paavo
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3931 - 3935
[34] Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation
Drugman, Thomas
Bozkurt, Bans
Dutoit, Thierry
[J]. SPEECH COMMUNICATION, 2011, 53 (06) : 855 - 866
[35] Estimation of the glottal source from coded telephone speech using deep neural networks
Narendra, N. P.
Airaksinen, Manu
Story, Brad
Alku, Paavo
[J]. SPEECH COMMUNICATION, 2019, 106 : 95 - 104
[36] Estimation of coherent source bearings using variational techniques
V. I. Turchin
G. E. Fiks
I. Sh. Fiks
[J]. Radiophysics and Quantum Electronics, 2007, 50 : 244 - 251
[37] ESTIMATION OF COHERENT SOURCE BEARINGS USING VARIATIONAL TECHNIQUES
Turchin, V. I.
Fiks, G. E.
Fiks, I. Sh.
[J]. RADIOPHYSICS AND QUANTUM ELECTRONICS, 2007, 50 (03) : 244 - 251
[38] Comparison of Blind Source Separation Techniques for Respiration Rate Estimation from Depth Video
Mozafari, Mohsen
Law, Andrew
Djouaka, Samuel Beni Tchoudem
Green, James R.
Goubran, Rafik A.
[J]. 2022 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC 2022), 2022,
[39] A comparative study of glottal open quotient estimation techniques
Kane, John
Scherer, Stefan
Morency, Louis-Philippe
Gobl, Christer
[J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1657 - 1661
[40] NOISE ROBUST ESTIMATION OF THE VOICE SOURCE USING A DEEP NEURAL NETWORK
Airaksinen, Manu
Raitio, Tuomo
Alku, Paavo
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5137 - 5141

← 1 2 3 4 5 →