Joint source-filter optimization for robust glottal source estimation in the presence of shimmer and jitter

被引：10

作者：

Ghosh, Prasanta Kumar ^{[1
]}

Narayanan, Shrikanth S. ^{[1
]}

机构：

[1] Univ So Calif, Dept Elect Engn, Signal Anal & Interpretat Lab, Los Angeles, CA 90089 USA

来源：

SPEECH COMMUNICATION | 2011年 / 53卷 / 01期

基金：

美国国家科学基金会;

关键词：

Glottal flow derivative; Shimmer; Jitter; Glottal source estimation; SPEECH; FLOW; MODEL; QUALITY;

D O I：

10.1016/j.specom.2010.07.004

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We propose a glottal source estimation method robust to shimmer and jitter in the glottal flow. The proposed estimation method is based on a joint source-filter optimization technique. The glottal source is modeled by the Liljencrants-Fant (LF) model and the vocal-tract filter is modeled by an auto-regressive filter, which is common in the source-filter approach to speech production. The optimization estimates the parameters of the LF model, the amplitudes of the glottal flow in each pitch period, and the vocal-tract filter coefficients so that the speech production model best describes the observed speech samples. Experiments with synthetic and real speech data show that the proposed estimation method is robust to different phonation types with varying shimmer and jitter characteristics. (C) 2010 Elsevier B.V. All rights reserved.

引用

页码：98 / 109

页数：12

共 50 条

[21] Nonlinear source-filter coupling in phonation: Theory
Titze, Ingo R.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (05): : 2733 - 2749
[22] Nonlinear interactive source-filter models for speech
Koc, Turgay
Ciloglu, Tolga
[J]. COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 365 - 394
[23] An Experimentally Measured Source-Filter Model: Glottal Flow, Vocal Tract Gain and Output Sound from a Physical Model
Wolfe, Joe
Chu, Derek Tze Wei
Chen, Jer-Ming
Smith, John
[J]. ACOUSTICS AUSTRALIA, 2016, 44 (01): : 187 - 191
[24] GLOTTAL SOURCE ASYMMETRY ESTIMATION BY ICA
Gomez-Vilda, Pedro
Fernandez-Baillo, Roberto
Rodellar-Biarge, Victoria
Puntonet, Carlos G.
[J]. BIOSIGNALS 2011, 2011, : 559 - +
[25] Glottal source estimation robustness - A comparison of sensitivity of voice source estimation techniques
Drugman, Thomas
Dubuisson, Thomas
Moinet, Alexis
D'Alessandro, Nicolas
Dutoit, Thierry
[J]. SIGMAP 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2008, : 202 - 207
[26] A Novel Source-Filter Stochastic Model for Voice Production
Cataldo, E.
Monteiro, L.
Soize, C.
[J]. JOURNAL OF VOICE, 2023, 37 (01) : 1 - 8
[27] Quantifying Parameters of a Source-Filter Model for Oesophageal Speech
Toole, John M. O'
Garcia Zapirain, Begona
[J]. 2011 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2011, : 532 - 537
[28] A SOURCE-FILTER MODEL FOR MUSICAL INSTRUMENT SOUND TRANSFORMATION
Caetano, Marcelo
Rodet, Xavier
[J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 137 - 140
[29] Nonlinear source-filter coupling in phonation: Vocal exercises
Titze, Ingo
Riede, Tobias
Popolo, Peter
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (04): : 1902 - 1915
[30] Source-filter Separation of Speech Signal in the Phase Domain
Loweimi, Erfan
Barker, Jon
Hain, Thomas
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 598 - 602

← 1 2 3 4 5 →