Glottal source estimation robustness - A comparison of sensitivity of voice source estimation techniques

被引:0
|
作者
Drugman, Thomas [1 ]
Dubuisson, Thomas [1 ]
Moinet, Alexis [1 ]
D'Alessandro, Nicolas [1 ]
Dutoit, Thierry [1 ]
机构
[1] Fac Polytech Mons, TCTS Lab, B-7000 Mons, Belgium
关键词
speech processing; speech analysis; voice source; glottal formant;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses the problem of estimating the voice source directly from speech waveforms. A novel principle based on Anticausality Dominated Regions (ACDR) is used to estimate the glottal open phase. This technique is compared to two other state-of-the-art well-known methods, namely the Zeros of the Z-Transform (ZZT) and the Iterative Adaptive Inverse Filtering (IAIF) algorithms. Decomposition quality is assessed on synthetic signals through two objective measures: the spectral distortion and a glottal formant determination rate. Technique robustness is tested by analyzing the influence of noise and Glottal Closure Instant (GCI) location errors. Besides impacts of the fundamental frequency and the first formant on the performance are evaluated. Our proposed approach shows significant improvement in robustness, which could be of a great interest when decomposing real speech.
引用
收藏
页码:202 / 207
页数:6
相关论文
共 50 条
  • [1] A comparative study of glottal source estimation techniques
    Drugman, Thomas
    Bozkurt, Baris
    Dutoit, Thierry
    [J]. COMPUTER SPEECH AND LANGUAGE, 2012, 26 (01): : 20 - 34
  • [2] On the use of voice descriptors for glottal source shape parameter estimation
    Huber, Stefan
    Roebel, Axel
    [J]. COMPUTER SPEECH AND LANGUAGE, 2014, 28 (05): : 1170 - 1194
  • [3] GLOTTAL SOURCE ASYMMETRY ESTIMATION BY ICA
    Gomez-Vilda, Pedro
    Fernandez-Baillo, Roberto
    Rodellar-Biarge, Victoria
    Puntonet, Carlos G.
    [J]. BIOSIGNALS 2011, 2011, : 559 - +
  • [4] Estimation method of glottal vocal efficiency based on conversion function of voice source
    ZOU Yuan WAN Mingxi ZHAO Shouguo WANG Supin(1 Department of Biomedical Engineering
    [J]. Chinese Journal of Acoustics, 2002, (04) : 332 - 342
  • [5] Glottal Source Estimation Using an Automatic Chirp Decomposition
    Drugman, Thomas
    Bozkurt, Baris
    Dutoit, Thierry
    [J]. ADVANCES IN NONLINEAR SPEECH PROCESSING, 2010, 5933 : 35 - +
  • [6] GLOTTAL SOURCE MODELING FOR VOICE CONVERSION
    CHILDERS, DG
    [J]. SPEECH COMMUNICATION, 1995, 16 (02) : 127 - 138
  • [7] Robust glottal source estimation based on joint source-filter model optimization
    Fu, Qiang
    Murphy, Peter
    Yan, Yong-Hong
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2007, 35 (05): : 982 - 986
  • [8] Robust glottal source estimation based on joint source-filter model optimization
    Fu, Q
    Murphy, P
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02): : 492 - 501
  • [9] Glottal Source Estimation Based on Bivariate Empirical Mode Decomposition
    Kemiha, Mina
    Kacha, Abdellah
    [J]. 2015 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 311 - +
  • [10] GLOTTAL SOURCE ESTIMATION USING A SUM-OF-EXPONENTIALS MODEL
    KRISHNAMURTHY, AK
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (03) : 682 - 686