Robust glottal source estimation based on joint source-filter model optimization

被引：51

作者：

Fu, Q ^{[1
]}

Murphy, P ^{[1
]}

机构：

[1] Univ Limerick, Dept Elect & Comp Engn, Limerick, Ireland

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2006年 / 14卷 / 02期

关键词：

convex optimization; glottal inverse filtering; source-filter joint optimization; source-filter separation; time-varying vocal tract filter;

D O I：

10.1109/TSA.2005.857807

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper describes a robust glottal source estimation method based on a joint source-filter separation technique. In this method, the Liljencrants-Fant (LF) model, which models the glottal flow derivative, is integrated into a time-varying ARX speech production model. These two models are estimated in a joint optimization procedure, in which a Kalman filtering process is embedded for adaptively identifying the vocal tract parameters. Since the formulated joint estimation problem is a multiparameter nonlinear optimization procedure, we separate the optimization procedure into two passes. The first pass initializes the glottal source and vocal tract models by solving a quasi-convex approximate optimization problem. Having robust initial values, the joint estimation procedure determines the accuracy of model estimation implemented with a trust-region descent optimization algorithm. Experiments with synthetic and real voice signals show that the proposed method is a robust glottal source parameter estimation method with a high degree of accuracy.

引用

页码：492 / 501

页数：10

共 50 条

[1] Robust glottal source estimation based on joint source-filter model optimization
Fu, Qiang
Murphy, Peter
Yan, Yong-Hong
[J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2007, 35 (05): : 982 - 986
[2] Joint source-filter optimization for robust glottal source estimation in the presence of shimmer and jitter
Ghosh, Prasanta Kumar
Narayanan, Shrikanth S.
[J]. SPEECH COMMUNICATION, 2011, 53 (01) : 98 - 109
[3] A SPECTRAL GLOTTAL FLOW MODEL FOR SOURCE-FILTER SEPARATION OF SPEECH
Perrotin, Olivier
McLoughlin, Ian
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7160 - 7164
[4] Joint Source-Filter Optimization for Accurate Vocal Tract Estimation Using Differential Evolution
Schleusing, Olaf
Kinnunen, Tomi
Story, Brad
Vesin, Jean-Marc
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (08): : 1560 - 1572
[5] Estimation of the source-filter model using temporal dynamics
Ihara, Mizuki
Maeda, Shin-ichi
Ishii, Shin
[J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 3103 - 3108
[6] Estimation of Source-Filter Interaction Regions Based on Electroglottography
Palaparthi, Anil
Maxfield, Lynn
Titze, Ingo R.
[J]. JOURNAL OF VOICE, 2019, 33 (03) : 269 - 276
[7] Analysis of glottal inverse filtering in the presence of source-filter interaction
Palaparthi, Anil
Titze, Ingo R.
[J]. SPEECH COMMUNICATION, 2020, 123 : 98 - 108
[8] Theory of glottal airflow and source-filter interaction in speaking and singing
Titze, IR
[J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2004, 90 (04) : 641 - 648
[9] MODELING PLUCKED GUITAR TONES VIA JOINT SOURCE-FILTER ESTIMATION
Migneco, Raymond V.
Kim, Youngmoo E.
[J]. 2011 IEEE DIGITAL SIGNAL PROCESSING WORKSHOP AND IEEE SIGNAL PROCESSING EDUCATION WORKSHOP (DSP/SPE), 2011, : 128 - 133
[10] The source-filter model and ringing pipes
Green, T
McKeown, JD
[J]. BRITISH JOURNAL OF AUDIOLOGY, 1997, 31 (02): : 105 - 105

← 1 2 3 4 5 →