Formant estimation of whispered speech based on spectral segmentation

被引：0

作者：

Gong Chenghui ^{[1
]}

Zhao Heming ^{[1
]}

Lu Gang ^{[1
]}

Liu Hanxin ^{[1
]}

机构：

[1] Soochow Univ, Sch Elect & Informat Engn, Suzhou 215021, Peoples R China

来源：

2006 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1 AND 2 | 2006年

关键词：

formant estimation; spectral segmentation; Linear Prediction; whispered speech;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Whispered speech, without vocal cord vibration and always in low SNR, is more difficult both in its analysis and recognition. Thus its formant estimation becomes prominent in each field. The proposed algorithm is based on spectral segmentation. The complete spectrum is segmented into K segments, each of which contains a single formant. Here, improved dynamic programming and Selective LP (Linear Predictive) methods are used The former offers segment boundaries, and the latter leads to the parameters of formant frequency and its bandwidth as well. For whispered speech, the gain of vocal tract transfer function is also important. The tests are carried on Chinese whispered vowels, and the proposed algorithm is proved to be efficient. In low SAT, the segment based LP method is obviously superior to the conventional LPC and LSP.

引用

页码：562 / +

页数：2

共 50 条

[21] A study of pitch, formant, and spectral estimation errors introduced by three lossy speech compression algorithms
van Son, RJJH
ACTA ACUSTICA UNITED WITH ACUSTICA, 2005, 91 (04) : 771 - 778
[22] The research of endpoint detection and initial/final segmentation for Chinese whispered speech
Chen, Xueqin
Zhao, Heming
2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 752 - +
[23] HTK-Based Recognition of Whispered Speech
Galic, Jovan
Jovicic, Slobodan T.
Grozdic, Dorde
Markovic, Branko
SPEECH AND COMPUTER, 2014, 8773 : 251 - 258
[24] Significance of parametric spectral ratio methods in detection and recognition of whispered speech
Mathur, Arpit
Reddy, Shankar M.
Hegde, Rajesh M.
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
[25] Fusion of bottleneck, spectral and modulation spectral features for improved speaker verification of neutral and whispered speech
Sarria-Paja, Milton
Falk, Tiago H.
SPEECH COMMUNICATION, 2018, 102 : 78 - 86
[26] Relationship between fundamental and formant frequency in whispered Mandarin
Chen, Xueqin
Zhao, Heming
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 303 - 306
[27] Cepstral method evaluation in speech formant frequencies estimation
Kammoun, MA
Gargouri, D
Frikha, M
Ben Hamida, A
2004 IEEE International Conference on Industrial Technology (ICIT), Vols. 1- 3, 2004, : 1612 - 1616
[28] Correlation based speech formant recovery
Nelson, D
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1643 - 1646
[29] Formant comparison between whispered and voiced vowels in Mandarin
Li, XL
Xu, BL
ACTA ACUSTICA UNITED WITH ACUSTICA, 2005, 91 (06) : 1079 - 1085
[30] Significance of parametric spectral ratio methods in detection and recognition of whispered speech
Arpit Mathur
Shankar M Reddy
Rajesh M Hegde
EURASIP Journal on Advances in Signal Processing, 2012

← 1 2 3 4 5 →