An Approach for Formant Based Speech Recognition in Noise

被引:0
|
作者
Fattah, Shaikh Anowarul [1 ]
Ghosh, Tonmoy [1 ]
Das, Apurba Kumar [1 ]
Goswami, Rajib [1 ]
Shafin, Abu [1 ]
Jameel, Mohammad Mahdee [1 ]
Shahnaz, Celia [1 ]
机构
[1] Bangladesh Univ Engn & Technol, Dept Elect & Elect Engn, Dhaka 1000, Bangladesh
关键词
Formant estimation; noise; higher order Yule-Walker equations; speech analysis; vowel recognition; TRACKING;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a noise robust formant frequency estimation scheme is developed utilizing the advantageous properties of the autocorrelation function of the band-limited noisy speech signal. It is shown that the use of autocorrelation operation on a speech signal, which is band-limited to a particular formant zone, in comparison to one without any band limitation, can provide higher noise immunity, especially under severe noisy condition. In order to extract each formant, a modified higher order Yule-Walker method is employed on the resulting autocorrelation sequence. Within a band, the pole with the maximum energy is selected as the formant. The estimated formants are used as features along with conventional Mel frequency cepstral coefficients in a vowel recognition system, where the linear discriminant based classifier is utilized. Extensive experimentation is carried out on speech samples taken from the TIMIT standard speech database. It is found that the proposed algorithm provides superior formant estimation accuracy in comparison to that obtained by some of the state of the art methods even at a very low level of signal-to-noise ratio (SNR) for both male and female speakers. Moreover, formant estimates obtained by the proposed method can also provide better vowel recognition accuracy in the presence of significant background noise.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Optimization of Formant Feature Based Speech Recognition
    Lipeika, Antanas
    [J]. INFORMATICA, 2010, 21 (03) : 361 - 374
  • [2] Formant estimation for speech recognition
    Welling, L
    Ney, H
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (01): : 36 - 48
  • [3] The formant structure based feature parameter for speech recognition
    Zhao, JH
    Kuang, JM
    Xie, X
    [J]. PROCEEDINGS OF THE 2003 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING, 2003, : 605 - 608
  • [4] Algorithms for Vowel Recognition in Fluent Speech Based on Formant Positions
    Stanek, Miroslav
    Polak, Ladislav
    [J]. 2013 36TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2013, : 521 - 525
  • [5] Speech Recognition System and Formant Based Analysis of Spoken Arabic Vowels
    Alotaibi, Yousef Ajami
    Hussain, Amir
    [J]. FUTURE GENERATION INFORMATION TECHNOLOGY, PROCEEDINGS, 2009, 5899 : 50 - +
  • [6] Formant weighted cepstral feature for LSP-based speech recognition
    Hur, HY
    Kim, HS
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 141 - 144
  • [7] FORMANT NORMALIZATION FOR SPEECH RECOGNITION AND VOWEL STUDIES
    HIERONYMUS, JL
    [J]. SPEECH COMMUNICATION, 1991, 10 (5-6) : 471 - 478
  • [8] FORMANT BASED SPEECH SYNTHESIS
    HUGHES, PM
    [J]. BRITISH TELECOM TECHNOLOGY JOURNAL, 1988, 6 (02): : 84 - 90
  • [9] Speech Perception in Noise With Formant Enhancement for Older Listeners
    Guan, Jingjing
    Liu, Chang
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2019, 62 (09): : 3290 - 3301
  • [10] A FORMANT TRACKING SYSTEM TOWARD AUTOMATIC RECOGNITION OF SPEECH
    LAFACE, P
    [J]. SIGNAL PROCESSING, 1980, 2 (02) : 113 - 129