Robust glottal source estimation based on joint source-filter model optimization

被引:51
|
作者
Fu, Q [1 ]
Murphy, P [1 ]
机构
[1] Univ Limerick, Dept Elect & Comp Engn, Limerick, Ireland
关键词
convex optimization; glottal inverse filtering; source-filter joint optimization; source-filter separation; time-varying vocal tract filter;
D O I
10.1109/TSA.2005.857807
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a robust glottal source estimation method based on a joint source-filter separation technique. In this method, the Liljencrants-Fant (LF) model, which models the glottal flow derivative, is integrated into a time-varying ARX speech production model. These two models are estimated in a joint optimization procedure, in which a Kalman filtering process is embedded for adaptively identifying the vocal tract parameters. Since the formulated joint estimation problem is a multiparameter nonlinear optimization procedure, we separate the optimization procedure into two passes. The first pass initializes the glottal source and vocal tract models by solving a quasi-convex approximate optimization problem. Having robust initial values, the joint estimation procedure determines the accuracy of model estimation implemented with a trust-region descent optimization algorithm. Experiments with synthetic and real voice signals show that the proposed method is a robust glottal source parameter estimation method with a high degree of accuracy.
引用
收藏
页码:492 / 501
页数:10
相关论文
共 50 条
  • [21] Modeling source-source and source-filter acoustic interaction in birdsong
    Laje, R
    Mindlin, GB
    [J]. PHYSICAL REVIEW E, 2005, 72 (03):
  • [22] Modeling and joint estimation of glottal source and vocal tract filter by state-space methods
    Alzamendi, Gabriel A.
    Schlotthauer, Gaston
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2017, 37 : 5 - 15
  • [23] Automatic Classification of Healthy Subjects and Patients With Essential Vocal Tremor Using Probabilistic Source-Filter Model Based Noise Robust Pitch Estimation
    Rao, M. V. Achuth
    Yamini, B. K.
    Ketan, J.
    Shetty, A. Preetie
    Pal, Pramod Kumar
    Shivashankar, N.
    Ghosh, Prasanta Kumar
    [J]. JOURNAL OF VOICE, 2023, 37 (03) : 314 - 321
  • [24] A Modified Additive Synthesis Method Using Source-Filter Model
    Korvel, Grazina
    Simonyte, Virginija
    Slivinskas, Vytautas
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2015, 63 (06): : 443 - 450
  • [25] A modified additive synthesis method using source-filter model
    Korvel, Gražina
    Šimonyte, Virginija
    Slivinskas, Vytautas
    [J]. AES: Journal of the Audio Engineering Society, 2015, 63 (06): : 443 - 450
  • [26] Source-Filter Modeling in the Sinusoid Domain
    Wen, Xue
    Sandler, Mark
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2010, 58 (10): : 795 - 808
  • [27] PSFM-A Probabilistic Source Filter Model for Noise Robust Glottal Closure Instant Detection
    Rao, Achuth M., V
    Ghosh, Prasanta Kumar
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1645 - 1657
  • [28] FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
    Bak, Taejun
    Bae, Jae-Sung
    Bae, Hanbin
    Kim, Young-Ik
    Cho, Hoon-Young
    [J]. INTERSPEECH 2021, 2021, : 116 - 120
  • [29] A Source-Filter based Adaptive Harmonic Model and Its Application to Speech Prosody Modification
    Lee, JeeSok
    Soong, Frank K.
    Kang, Hong-Goo
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 39 - 43
  • [30] TOWARDS SOURCE-FILTER BASED SINGLE SENSOR SPEECH SEPARATION
    Stark, Michael
    Pernkopf, Franz
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 97 - 100