Joint source-filter optimization for robust glottal source estimation in the presence of shimmer and jitter

被引:10
|
作者
Ghosh, Prasanta Kumar [1 ]
Narayanan, Shrikanth S. [1 ]
机构
[1] Univ So Calif, Dept Elect Engn, Signal Anal & Interpretat Lab, Los Angeles, CA 90089 USA
基金
美国国家科学基金会;
关键词
Glottal flow derivative; Shimmer; Jitter; Glottal source estimation; SPEECH; FLOW; MODEL; QUALITY;
D O I
10.1016/j.specom.2010.07.004
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose a glottal source estimation method robust to shimmer and jitter in the glottal flow. The proposed estimation method is based on a joint source-filter optimization technique. The glottal source is modeled by the Liljencrants-Fant (LF) model and the vocal-tract filter is modeled by an auto-regressive filter, which is common in the source-filter approach to speech production. The optimization estimates the parameters of the LF model, the amplitudes of the glottal flow in each pitch period, and the vocal-tract filter coefficients so that the speech production model best describes the observed speech samples. Experiments with synthetic and real speech data show that the proposed estimation method is robust to different phonation types with varying shimmer and jitter characteristics. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:98 / 109
页数:12
相关论文
共 50 条
  • [21] Nonlinear source-filter coupling in phonation: Theory
    Titze, Ingo R.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (05): : 2733 - 2749
  • [22] Nonlinear interactive source-filter models for speech
    Koc, Turgay
    Ciloglu, Tolga
    [J]. COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 365 - 394
  • [23] An Experimentally Measured Source-Filter Model: Glottal Flow, Vocal Tract Gain and Output Sound from a Physical Model
    Wolfe, Joe
    Chu, Derek Tze Wei
    Chen, Jer-Ming
    Smith, John
    [J]. ACOUSTICS AUSTRALIA, 2016, 44 (01): : 187 - 191
  • [24] GLOTTAL SOURCE ASYMMETRY ESTIMATION BY ICA
    Gomez-Vilda, Pedro
    Fernandez-Baillo, Roberto
    Rodellar-Biarge, Victoria
    Puntonet, Carlos G.
    [J]. BIOSIGNALS 2011, 2011, : 559 - +
  • [25] Glottal source estimation robustness - A comparison of sensitivity of voice source estimation techniques
    Drugman, Thomas
    Dubuisson, Thomas
    Moinet, Alexis
    D'Alessandro, Nicolas
    Dutoit, Thierry
    [J]. SIGMAP 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2008, : 202 - 207
  • [26] A Novel Source-Filter Stochastic Model for Voice Production
    Cataldo, E.
    Monteiro, L.
    Soize, C.
    [J]. JOURNAL OF VOICE, 2023, 37 (01) : 1 - 8
  • [27] Quantifying Parameters of a Source-Filter Model for Oesophageal Speech
    Toole, John M. O'
    Garcia Zapirain, Begona
    [J]. 2011 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2011, : 532 - 537
  • [28] A SOURCE-FILTER MODEL FOR MUSICAL INSTRUMENT SOUND TRANSFORMATION
    Caetano, Marcelo
    Rodet, Xavier
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 137 - 140
  • [29] Nonlinear source-filter coupling in phonation: Vocal exercises
    Titze, Ingo
    Riede, Tobias
    Popolo, Peter
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (04): : 1902 - 1915
  • [30] Source-filter Separation of Speech Signal in the Phase Domain
    Loweimi, Erfan
    Barker, Jon
    Hain, Thomas
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 598 - 602