Robust glottal source estimation based on joint source-filter model optimization

被引:51
|
作者
Fu, Q [1 ]
Murphy, P [1 ]
机构
[1] Univ Limerick, Dept Elect & Comp Engn, Limerick, Ireland
关键词
convex optimization; glottal inverse filtering; source-filter joint optimization; source-filter separation; time-varying vocal tract filter;
D O I
10.1109/TSA.2005.857807
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a robust glottal source estimation method based on a joint source-filter separation technique. In this method, the Liljencrants-Fant (LF) model, which models the glottal flow derivative, is integrated into a time-varying ARX speech production model. These two models are estimated in a joint optimization procedure, in which a Kalman filtering process is embedded for adaptively identifying the vocal tract parameters. Since the formulated joint estimation problem is a multiparameter nonlinear optimization procedure, we separate the optimization procedure into two passes. The first pass initializes the glottal source and vocal tract models by solving a quasi-convex approximate optimization problem. Having robust initial values, the joint estimation procedure determines the accuracy of model estimation implemented with a trust-region descent optimization algorithm. Experiments with synthetic and real voice signals show that the proposed method is a robust glottal source parameter estimation method with a high degree of accuracy.
引用
收藏
页码:492 / 501
页数:10
相关论文
共 50 条
  • [31] Towards Robust Glottal Source Modeling
    Perez, Javier
    Bonafonte, Antonio
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 56 - 59
  • [32] Nonlinear source-filter coupling in phonation: Theory
    Titze, Ingo R.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (05): : 2733 - 2749
  • [33] Nonlinear source-filter coupling in phonation: Theory
    Titze, Ingo R.
    [J]. Journal of the Acoustical Society of America, 2008, 123 (05): : 2733 - 2749
  • [34] A dual source-filter model of snore audio for snorer group classification
    Rao, Achuth M., V
    Yadav, Shivani
    Ghosh, Prasanta Kumar
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3502 - 3506
  • [35] Nonlinear interactive source-filter models for speech
    Koc, Turgay
    Ciloglu, Tolga
    [J]. COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 365 - 394
  • [36] GLOTTAL SOURCE ASYMMETRY ESTIMATION BY ICA
    Gomez-Vilda, Pedro
    Fernandez-Baillo, Roberto
    Rodellar-Biarge, Victoria
    Puntonet, Carlos G.
    [J]. BIOSIGNALS 2011, 2011, : 559 - +
  • [37] GLOTTAL SOURCE ESTIMATION USING A SUM-OF-EXPONENTIALS MODEL
    KRISHNAMURTHY, AK
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (03) : 682 - 686
  • [38] Speech Analysis Method Based on Source-Filter Model Using Multivariate Empirical Mode Decomposition
    Boonkla, Surasak
    Unoki, Masashi
    Makhanov, Stanislav S.
    Wutiwiwatchai, Chai
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2016, E99A (10) : 1762 - 1773
  • [39] The influence of source-filter interaction on the voice source in a three-dimensional computational model of voice production
    Zhang, Zhaoyan
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 154 (04): : 2462 - 2475
  • [40] Glottal source estimation robustness - A comparison of sensitivity of voice source estimation techniques
    Drugman, Thomas
    Dubuisson, Thomas
    Moinet, Alexis
    D'Alessandro, Nicolas
    Dutoit, Thierry
    [J]. SIGMAP 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2008, : 202 - 207