Joint source-filter optimization for robust glottal source estimation in the presence of shimmer and jitter

被引:10
|
作者
Ghosh, Prasanta Kumar [1 ]
Narayanan, Shrikanth S. [1 ]
机构
[1] Univ So Calif, Dept Elect Engn, Signal Anal & Interpretat Lab, Los Angeles, CA 90089 USA
基金
美国国家科学基金会;
关键词
Glottal flow derivative; Shimmer; Jitter; Glottal source estimation; SPEECH; FLOW; MODEL; QUALITY;
D O I
10.1016/j.specom.2010.07.004
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose a glottal source estimation method robust to shimmer and jitter in the glottal flow. The proposed estimation method is based on a joint source-filter optimization technique. The glottal source is modeled by the Liljencrants-Fant (LF) model and the vocal-tract filter is modeled by an auto-regressive filter, which is common in the source-filter approach to speech production. The optimization estimates the parameters of the LF model, the amplitudes of the glottal flow in each pitch period, and the vocal-tract filter coefficients so that the speech production model best describes the observed speech samples. Experiments with synthetic and real speech data show that the proposed estimation method is robust to different phonation types with varying shimmer and jitter characteristics. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:98 / 109
页数:12
相关论文
共 50 条
  • [1] Robust glottal source estimation based on joint source-filter model optimization
    Fu, Qiang
    Murphy, Peter
    Yan, Yong-Hong
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2007, 35 (05): : 982 - 986
  • [2] Robust glottal source estimation based on joint source-filter model optimization
    Fu, Q
    Murphy, P
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02): : 492 - 501
  • [3] Analysis of glottal inverse filtering in the presence of source-filter interaction
    Palaparthi, Anil
    Titze, Ingo R.
    [J]. SPEECH COMMUNICATION, 2020, 123 : 98 - 108
  • [4] Joint Source-Filter Optimization for Accurate Vocal Tract Estimation Using Differential Evolution
    Schleusing, Olaf
    Kinnunen, Tomi
    Story, Brad
    Vesin, Jean-Marc
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (08): : 1560 - 1572
  • [5] A SPECTRAL GLOTTAL FLOW MODEL FOR SOURCE-FILTER SEPARATION OF SPEECH
    Perrotin, Olivier
    McLoughlin, Ian
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7160 - 7164
  • [6] Theory of glottal airflow and source-filter interaction in speaking and singing
    Titze, IR
    [J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2004, 90 (04) : 641 - 648
  • [7] MODELING PLUCKED GUITAR TONES VIA JOINT SOURCE-FILTER ESTIMATION
    Migneco, Raymond V.
    Kim, Youngmoo E.
    [J]. 2011 IEEE DIGITAL SIGNAL PROCESSING WORKSHOP AND IEEE SIGNAL PROCESSING EDUCATION WORKSHOP (DSP/SPE), 2011, : 128 - 133
  • [8] Estimation of Source-Filter Interaction Regions Based on Electroglottography
    Palaparthi, Anil
    Maxfield, Lynn
    Titze, Ingo R.
    [J]. JOURNAL OF VOICE, 2019, 33 (03) : 269 - 276
  • [9] Estimation of the source-filter model using temporal dynamics
    Ihara, Mizuki
    Maeda, Shin-ichi
    Ishii, Shin
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 3103 - 3108
  • [10] Robust Source-Filter Separation of Speech Signal in the Phase Domain
    Loweimi, Erfan
    Barker, Jon
    Torralba, Oscar Saz
    Hain, Thomas
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 414 - 418