Formant estimation of whispered speech based on spectral segmentation

被引:0
|
作者
Gong Chenghui [1 ]
Zhao Heming [1 ]
Lu Gang [1 ]
Liu Hanxin [1 ]
机构
[1] Soochow Univ, Sch Elect & Informat Engn, Suzhou 215021, Peoples R China
关键词
formant estimation; spectral segmentation; Linear Prediction; whispered speech;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Whispered speech, without vocal cord vibration and always in low SNR, is more difficult both in its analysis and recognition. Thus its formant estimation becomes prominent in each field. The proposed algorithm is based on spectral segmentation. The complete spectrum is segmented into K segments, each of which contains a single formant. Here, improved dynamic programming and Selective LP (Linear Predictive) methods are used The former offers segment boundaries, and the latter leads to the parameters of formant frequency and its bandwidth as well. For whispered speech, the gain of vocal tract transfer function is also important. The tests are carried on Chinese whispered vowels, and the proposed algorithm is proved to be efficient. In low SAT, the segment based LP method is obviously superior to the conventional LPC and LSP.
引用
收藏
页码:562 / +
页数:2
相关论文
共 50 条
  • [21] A study of pitch, formant, and spectral estimation errors introduced by three lossy speech compression algorithms
    van Son, RJJH
    ACTA ACUSTICA UNITED WITH ACUSTICA, 2005, 91 (04) : 771 - 778
  • [22] The research of endpoint detection and initial/final segmentation for Chinese whispered speech
    Chen, Xueqin
    Zhao, Heming
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 752 - +
  • [23] HTK-Based Recognition of Whispered Speech
    Galic, Jovan
    Jovicic, Slobodan T.
    Grozdic, Dorde
    Markovic, Branko
    SPEECH AND COMPUTER, 2014, 8773 : 251 - 258
  • [24] Significance of parametric spectral ratio methods in detection and recognition of whispered speech
    Mathur, Arpit
    Reddy, Shankar M.
    Hegde, Rajesh M.
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
  • [25] Fusion of bottleneck, spectral and modulation spectral features for improved speaker verification of neutral and whispered speech
    Sarria-Paja, Milton
    Falk, Tiago H.
    SPEECH COMMUNICATION, 2018, 102 : 78 - 86
  • [26] Relationship between fundamental and formant frequency in whispered Mandarin
    Chen, Xueqin
    Zhao, Heming
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 303 - 306
  • [27] Cepstral method evaluation in speech formant frequencies estimation
    Kammoun, MA
    Gargouri, D
    Frikha, M
    Ben Hamida, A
    2004 IEEE International Conference on Industrial Technology (ICIT), Vols. 1- 3, 2004, : 1612 - 1616
  • [28] Correlation based speech formant recovery
    Nelson, D
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1643 - 1646
  • [29] Formant comparison between whispered and voiced vowels in Mandarin
    Li, XL
    Xu, BL
    ACTA ACUSTICA UNITED WITH ACUSTICA, 2005, 91 (06) : 1079 - 1085
  • [30] Significance of parametric spectral ratio methods in detection and recognition of whispered speech
    Arpit Mathur
    Shankar M Reddy
    Rajesh M Hegde
    EURASIP Journal on Advances in Signal Processing, 2012