Glottal source estimation robustness - A comparison of sensitivity of voice source estimation techniques

被引:0
|
作者
Drugman, Thomas [1 ]
Dubuisson, Thomas [1 ]
Moinet, Alexis [1 ]
D'Alessandro, Nicolas [1 ]
Dutoit, Thierry [1 ]
机构
[1] Fac Polytech Mons, TCTS Lab, B-7000 Mons, Belgium
关键词
speech processing; speech analysis; voice source; glottal formant;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses the problem of estimating the voice source directly from speech waveforms. A novel principle based on Anticausality Dominated Regions (ACDR) is used to estimate the glottal open phase. This technique is compared to two other state-of-the-art well-known methods, namely the Zeros of the Z-Transform (ZZT) and the Iterative Adaptive Inverse Filtering (IAIF) algorithms. Decomposition quality is assessed on synthetic signals through two objective measures: the spectral distortion and a glottal formant determination rate. Technique robustness is tested by analyzing the influence of noise and Glottal Closure Instant (GCI) location errors. Besides impacts of the fundamental frequency and the first formant on the performance are evaluated. Our proposed approach shows significant improvement in robustness, which could be of a great interest when decomposing real speech.
引用
收藏
页码:202 / 207
页数:6
相关论文
共 50 条
  • [41] Estimation of glottal source waveforms and vocal tract shape for singing voices with wide frequency range
    Takahashi, Kyoko
    Akagi, Masato
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1879 - 1887
  • [42] Fast and robust joint estimation of vocal tract and voice source parameters
    Ding, W
    Campbell, N
    Higuchi, N
    Kasuya, H
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1291 - 1294
  • [43] NOISE ROBUST ESTIMATION OF THE VOICE SOURCE USING A DEEP NEURAL NETWORK
    Airaksinen, Manu
    Raitio, Tuomo
    Alku, Paavo
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5137 - 5141
  • [44] Adding Glottal Source Information to Intra-lingual Voice Conversion
    Perez, Javier
    Bonafonte, Antonio
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2784 - +
  • [45] Voice source analysis using biomechanical modeling and glottal inverse filtering
    Pinheiro, Alan P.
    Raitio, Tuomo
    Gomes, Danyane S.
    Alku, Paavo
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1602 - 1605
  • [46] A comparative evaluation of the Zeros of Z Transform representation for voice source estimation
    Sturmel, Nicolas
    d'Alessandro, Christophe
    Doval, Boris
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 777 - 780
  • [47] A comparison of glottal voice source quantification parameters in breathy, normal and pressed phonation of female and male speakers
    Alku, P
    Vilkman, E
    [J]. FOLIA PHONIATRICA ET LOGOPAEDICA, 1996, 48 (05) : 240 - 254
  • [48] Comparison of spectral and subspace algorithms for FM source estimation
    Ejaz, S.
    Shafiq, M.A.
    [J]. Progress In Electromagnetics Research C, 2010, 14 : 11 - 21
  • [49] ASSESSING SENSITIVITY OF OBSERVATIONS IN SOURCE TERM ESTIMATION FOR NUCLEAR ACCIDENTS
    Ma, Yuanwei
    Wang, Dezhong
    Tan, Wenji
    Ji, Zhilong
    Zhang, Kuo
    [J]. PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE ON NUCLEAR ENGINEERING AND THE ASME 2012 POWER CONFERENCE - 2012, VOL 2, 2012, : 377 - 382
  • [50] Methods of sensitivity theory and inverse modeling for estimation of source parameters
    Penenko, V
    Baklanov, A
    Tsvetova, E
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2002, 18 (05): : 661 - 671