Estimation of glottal closure instants by considering speech signal as a spectrum

被引:5
|
作者
Sripriya, N. [1 ]
Nagarajan, T. [1 ]
机构
[1] SSN Coll Engn, Madras, Tamil Nadu, India
关键词
D O I
10.1049/el.2014.4444
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Close to glottal closure instants (GCIs), the speech signal is expected to change its amplitude rapidly and, at GCIs, it is expected to have strong negative peaks. A novel algorithm that exploits these two properties for the estimation of GCIs is presented. Here, a symmetrised speech segment is assumed to be a Fourier transform (FT) of an even function. In such a case, at the locations of the GCIs, the strong negative peaks in the symmetrised speech segment correspond to zeros that lie considerably outside the unit circle in the z-plane. The group delay spectrum of the time-domain signal derived by taking inverse FT of this assumed FT is expected to take a value close to -2 pi at the angular locations of these zeros. Mapping frequency scale to time scale, the frequency bins for which group delay reaches -2 pi correspond to the locations of GCIs. Theoretical justification for the proposed approach is also presented by defining a novel function called the conditional group delay function. Systematic evaluation is carried out on the CMU Arctic database and the performance of the proposed technique is better than that of the algorithms namely DYPSA, ZFF, YAGA and is close to that of SEDREAMS.
引用
收藏
页码:649 / 651
页数:2
相关论文
共 50 条
  • [1] The DYPSA algorithm for estimation of glottal closure instants in voiced speech
    Kounoudes, A
    Naylor, PA
    Brookes, M
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 349 - 352
  • [2] Estimation of Glottal Closure Instants from Telephone Speech using a Group Delay-Based Approach that Considers Speech Signal as a Spectrum
    Rachel, G. Anushiya
    Vijayalakshmi, P.
    Nagarajan, T.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1181 - 1185
  • [3] Significance of Differenced EGG Signal as a Spectrum in Phase Difference Computation for the Estimation of Glottal Closure Instants
    G. Anushiya Rachel
    N. Sripriya
    P. Vijayalakshmi
    T. Nagarajan
    Circuits, Systems, and Signal Processing, 2018, 37 : 2074 - 2097
  • [4] Significance of Differenced EGG Signal as a Spectrum in Phase Difference Computation for the Estimation of Glottal Closure Instants
    Rachel, G. Anushiya
    Sripriya, N.
    Vijayalakshmi, P.
    Nagarajan, T.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (05) : 2074 - 2097
  • [5] Estimation of glottal closure instants in voiced speech using the DYPSA algorithm
    Naylor, Patrick A.
    Kounoudes, Anastasis
    Gudnason, Jon
    Brookes, Mike
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 34 - 43
  • [6] Accurate Estimation of Glottal Closure Instants and Glottal Opening Instants from Electroglottographic Signal Using Variational Mode Decomposition
    G. Jyothish Lal
    E. A. Gopalakrishnan
    D. Govind
    Circuits, Systems, and Signal Processing, 2018, 37 : 810 - 830
  • [7] Accurate Estimation of Glottal Closure Instants and Glottal Opening Instants from Electroglottographic Signal Using Variational Mode Decomposition
    Lal, G. Jyothish
    Gopalakrishnan, E. A.
    Govind, D.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (02) : 810 - 830
  • [8] USING EXTREME GRADIENT BOOSTING TO DETECT GLOTTAL CLOSURE INSTANTS IN SPEECH SIGNAL
    Matousek, Jindrich
    Tihelka, Daniel
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6515 - 6519
  • [9] Comparison of glottal closure instants obtained by using wavelet transform of speech signal and EGG signal
    Seok, JW
    Bae, KS
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1999, E82D (11) : 1486 - 1488
  • [10] COMPARISON OF GLOTTAL CLOSURE INSTANTS DETECTION ALGORITHMS FOR EMOTIONAL SPEECH
    Kadiri, Sudarsana Reddy
    Alku, Paavo
    Yegnanarayana, B.
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7379 - 7383