Robust glottal source estimation based on joint source-filter model optimization

被引:51
|
作者
Fu, Q [1 ]
Murphy, P [1 ]
机构
[1] Univ Limerick, Dept Elect & Comp Engn, Limerick, Ireland
关键词
convex optimization; glottal inverse filtering; source-filter joint optimization; source-filter separation; time-varying vocal tract filter;
D O I
10.1109/TSA.2005.857807
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a robust glottal source estimation method based on a joint source-filter separation technique. In this method, the Liljencrants-Fant (LF) model, which models the glottal flow derivative, is integrated into a time-varying ARX speech production model. These two models are estimated in a joint optimization procedure, in which a Kalman filtering process is embedded for adaptively identifying the vocal tract parameters. Since the formulated joint estimation problem is a multiparameter nonlinear optimization procedure, we separate the optimization procedure into two passes. The first pass initializes the glottal source and vocal tract models by solving a quasi-convex approximate optimization problem. Having robust initial values, the joint estimation procedure determines the accuracy of model estimation implemented with a trust-region descent optimization algorithm. Experiments with synthetic and real voice signals show that the proposed method is a robust glottal source parameter estimation method with a high degree of accuracy.
引用
收藏
页码:492 / 501
页数:10
相关论文
共 50 条
  • [1] Robust glottal source estimation based on joint source-filter model optimization
    Fu, Qiang
    Murphy, Peter
    Yan, Yong-Hong
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2007, 35 (05): : 982 - 986
  • [2] Joint source-filter optimization for robust glottal source estimation in the presence of shimmer and jitter
    Ghosh, Prasanta Kumar
    Narayanan, Shrikanth S.
    [J]. SPEECH COMMUNICATION, 2011, 53 (01) : 98 - 109
  • [3] A SPECTRAL GLOTTAL FLOW MODEL FOR SOURCE-FILTER SEPARATION OF SPEECH
    Perrotin, Olivier
    McLoughlin, Ian
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7160 - 7164
  • [4] Joint Source-Filter Optimization for Accurate Vocal Tract Estimation Using Differential Evolution
    Schleusing, Olaf
    Kinnunen, Tomi
    Story, Brad
    Vesin, Jean-Marc
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (08): : 1560 - 1572
  • [5] Estimation of the source-filter model using temporal dynamics
    Ihara, Mizuki
    Maeda, Shin-ichi
    Ishii, Shin
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 3103 - 3108
  • [6] Estimation of Source-Filter Interaction Regions Based on Electroglottography
    Palaparthi, Anil
    Maxfield, Lynn
    Titze, Ingo R.
    [J]. JOURNAL OF VOICE, 2019, 33 (03) : 269 - 276
  • [7] Analysis of glottal inverse filtering in the presence of source-filter interaction
    Palaparthi, Anil
    Titze, Ingo R.
    [J]. SPEECH COMMUNICATION, 2020, 123 : 98 - 108
  • [8] Theory of glottal airflow and source-filter interaction in speaking and singing
    Titze, IR
    [J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2004, 90 (04) : 641 - 648
  • [9] MODELING PLUCKED GUITAR TONES VIA JOINT SOURCE-FILTER ESTIMATION
    Migneco, Raymond V.
    Kim, Youngmoo E.
    [J]. 2011 IEEE DIGITAL SIGNAL PROCESSING WORKSHOP AND IEEE SIGNAL PROCESSING EDUCATION WORKSHOP (DSP/SPE), 2011, : 128 - 133
  • [10] The source-filter model and ringing pipes
    Green, T
    McKeown, JD
    [J]. BRITISH JOURNAL OF AUDIOLOGY, 1997, 31 (02): : 105 - 105