Glottal source estimation robustness - A comparison of sensitivity of voice source estimation techniques

被引:0
|
作者
Drugman, Thomas [1 ]
Dubuisson, Thomas [1 ]
Moinet, Alexis [1 ]
D'Alessandro, Nicolas [1 ]
Dutoit, Thierry [1 ]
机构
[1] Fac Polytech Mons, TCTS Lab, B-7000 Mons, Belgium
关键词
speech processing; speech analysis; voice source; glottal formant;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses the problem of estimating the voice source directly from speech waveforms. A novel principle based on Anticausality Dominated Regions (ACDR) is used to estimate the glottal open phase. This technique is compared to two other state-of-the-art well-known methods, namely the Zeros of the Z-Transform (ZZT) and the Iterative Adaptive Inverse Filtering (IAIF) algorithms. Decomposition quality is assessed on synthetic signals through two objective measures: the spectral distortion and a glottal formant determination rate. Technique robustness is tested by analyzing the influence of noise and Glottal Closure Instant (GCI) location errors. Besides impacts of the fundamental frequency and the first formant on the performance are evaluated. Our proposed approach shows significant improvement in robustness, which could be of a great interest when decomposing real speech.
引用
收藏
页码:202 / 207
页数:6
相关论文
共 50 条
  • [31] Analysis and Detection of Pathological Voice Using Glottal Source Features
    Kadiri, Sudarsana Reddy
    Alku, Paavo
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (02) : 367 - 379
  • [32] Assessment of disordered voice based on an optimized glottal source model
    Boudjerda, Mounir
    Kacha, Abdellah
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (04) : 3201 - 3214
  • [33] Glottal source estimation from coded telephone speech using a deep neural network
    Narendra, N. P.
    Airaksinen, Manu
    Alku, Paavo
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3931 - 3935
  • [34] Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation
    Drugman, Thomas
    Bozkurt, Bans
    Dutoit, Thierry
    [J]. SPEECH COMMUNICATION, 2011, 53 (06) : 855 - 866
  • [35] Estimation of the glottal source from coded telephone speech using deep neural networks
    Narendra, N. P.
    Airaksinen, Manu
    Story, Brad
    Alku, Paavo
    [J]. SPEECH COMMUNICATION, 2019, 106 : 95 - 104
  • [36] Estimation of coherent source bearings using variational techniques
    V. I. Turchin
    G. E. Fiks
    I. Sh. Fiks
    [J]. Radiophysics and Quantum Electronics, 2007, 50 : 244 - 251
  • [37] ESTIMATION OF COHERENT SOURCE BEARINGS USING VARIATIONAL TECHNIQUES
    Turchin, V. I.
    Fiks, G. E.
    Fiks, I. Sh.
    [J]. RADIOPHYSICS AND QUANTUM ELECTRONICS, 2007, 50 (03) : 244 - 251
  • [38] Comparison of Blind Source Separation Techniques for Respiration Rate Estimation from Depth Video
    Mozafari, Mohsen
    Law, Andrew
    Djouaka, Samuel Beni Tchoudem
    Green, James R.
    Goubran, Rafik A.
    [J]. 2022 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC 2022), 2022,
  • [39] A comparative study of glottal open quotient estimation techniques
    Kane, John
    Scherer, Stefan
    Morency, Louis-Philippe
    Gobl, Christer
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1657 - 1661
  • [40] NOISE ROBUST ESTIMATION OF THE VOICE SOURCE USING A DEEP NEURAL NETWORK
    Airaksinen, Manu
    Raitio, Tuomo
    Alku, Paavo
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5137 - 5141