Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation

被引:43
|
作者
Drugman, Thomas [1 ]
Bozkurt, Bans [2 ]
Dutoit, Thierry [1 ]
机构
[1] Univ Mons, TCTS Lab, B-7000 Mons, Belgium
[2] Izmir Inst Technol, Dept Elect & Elect Engn, Izmir, Turkey
关键词
Complex cepstrum; Homomorphic analysis; Glottal source estimation; Source-tract separation;
D O I
10.1016/j.specom.2011.02.004
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Complex cepstrum is known in the literature for linearly separating causal and anticausal components. Relying on advances achieved by the Zeros of the Z-Transform (ZZT) technique, we here investigate the possibility of using complex cepstrum for glottal flow estimation on a large-scale database. Via a systematic study of the windowing effects on the deconvolution quality, we show that the complex cepstrum causal-anticausal decomposition can be effectively used for glottal flow estimation when specific windowing criteria are met. It is also shown that this complex cepstral decomposition gives similar glottal estimates as obtained with the ZZT method. However, as complex cepstrum uses FFT operations instead of requiring the factoring of high-degree polynomials, the method benefits from a much higher speed. Finally in our tests on a large corpus of real expressive speech, we show that the proposed method has the potential to be used for voice quality analysis. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:855 / 866
页数:12
相关论文
共 50 条
  • [1] Complex Cepstrum-based Decomposition of Speech for Glottal Source Estimation
    Drugman, Thomas
    Bozkurt, Baris
    Dutoit, Thierry
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 108 - 111
  • [2] Glottal Source Estimation Using an Automatic Chirp Decomposition
    Drugman, Thomas
    Bozkurt, Baris
    Dutoit, Thierry
    [J]. ADVANCES IN NONLINEAR SPEECH PROCESSING, 2010, 5933 : 35 - +
  • [3] Chirp Complex Cepstrum-based Decomposition for Asynchronous Glottal Analysis
    Drugman, Thomas
    Dutoit, Thierry
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 657 - 660
  • [4] Speech Modeling Using the Complex Cepstrum
    Vondra, Martin
    Vich, Robert
    [J]. TOWARD AUTONOMOUS, ADAPTIVE, AND CONTEXT-AWARE MULTIMODAL INTERFACES: THEORETICAL AND PRACTICAL ISSUES, 2011, 6456 : 324 - 330
  • [5] Reconstruction Of Speech Signal Using Empirical Mode Decomposition Based Glottal Source Extraction
    Goswami, Nisha
    Sarma, Mousmita
    Sarma, Kandarpa Kumar
    [J]. 2013 1ST INTERNATIONAL CONFERENCE ON EMERGING TRENDS AND APPLICATIONS IN COMPUTER SCIENCE (ICETACS), 2013, : 27 - 32
  • [6] Estimation of the glottal source from coded telephone speech using deep neural networks
    Narendra, N. P.
    Airaksinen, Manu
    Story, Brad
    Alku, Paavo
    [J]. SPEECH COMMUNICATION, 2019, 106 : 95 - 104
  • [7] Glottal source estimation from coded telephone speech using a deep neural network
    Narendra, N. P.
    Airaksinen, Manu
    Alku, Paavo
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3931 - 3935
  • [8] Glottal Source Estimation Based on Bivariate Empirical Mode Decomposition
    Kemiha, Mina
    Kacha, Abdellah
    [J]. 2015 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 311 - +
  • [9] Stochastic glottal source applied to voiced-speech decomposition using state-space methods
    Alzamendi, Gabriel A.
    Schlottbauer, Gaston
    Torres, Maria E.
    [J]. 2015 XVI WORKSHOP ON INFORMATION PROCESSING AND CONTROL (RPIC), 2015,
  • [10] ITERATIVE ESTIMATION OF PHASE USING COMPLEX CEPSTRUM REPRESENTATION
    Maia, Ranniery
    Stylianou, Yannis
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 4990 - 4994