An Approach for Formant Based Speech Recognition in Noise

被引:0
|
作者
Fattah, Shaikh Anowarul [1 ]
Ghosh, Tonmoy [1 ]
Das, Apurba Kumar [1 ]
Goswami, Rajib [1 ]
Shafin, Abu [1 ]
Jameel, Mohammad Mahdee [1 ]
Shahnaz, Celia [1 ]
机构
[1] Bangladesh Univ Engn & Technol, Dept Elect & Elect Engn, Dhaka 1000, Bangladesh
关键词
Formant estimation; noise; higher order Yule-Walker equations; speech analysis; vowel recognition; TRACKING;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a noise robust formant frequency estimation scheme is developed utilizing the advantageous properties of the autocorrelation function of the band-limited noisy speech signal. It is shown that the use of autocorrelation operation on a speech signal, which is band-limited to a particular formant zone, in comparison to one without any band limitation, can provide higher noise immunity, especially under severe noisy condition. In order to extract each formant, a modified higher order Yule-Walker method is employed on the resulting autocorrelation sequence. Within a band, the pole with the maximum energy is selected as the formant. The estimated formants are used as features along with conventional Mel frequency cepstral coefficients in a vowel recognition system, where the linear discriminant based classifier is utilized. Extensive experimentation is carried out on speech samples taken from the TIMIT standard speech database. It is found that the proposed algorithm provides superior formant estimation accuracy in comparison to that obtained by some of the state of the art methods even at a very low level of signal-to-noise ratio (SNR) for both male and female speakers. Moreover, formant estimates obtained by the proposed method can also provide better vowel recognition accuracy in the presence of significant background noise.
引用
收藏
页数:4
相关论文
共 50 条
  • [11] A perceptual masking approach for noise robust speech recognition
    Hari Krishna Maganti
    Marco Matassoni
    EURASIP Journal on Audio, Speech, and Music Processing, 2012
  • [12] Robust speech recognition using a noise rejection approach
    Khan, E
    Levinson, R
    IEEE INTERNATIONAL JOINT SYMPOSIA ON INTELLIGENCE AND SYSTEMS - PROCEEDINGS, 1998, : 326 - 335
  • [13] Incorporating formant cues into distributed speech recognition systems
    Norouzian, Atta
    Selouani, Sid--Ahmed
    Tolba, Hesham
    O'Shaughnessy, Douglas
    2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 1159 - +
  • [14] A perceptual masking approach for noise robust speech recognition
    Maganti, Hari Krishna
    Matassoni, Marco
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2012,
  • [15] Noise Speech Recognition Based on Compressive Sensing
    Zhao Zhi-peng
    Cen Yi-gang
    Chen Xiao-fang
    COMPUTATIONAL MATERIALS SCIENCE, PTS 1-3, 2011, 268-270 : 82 - +
  • [16] Correlation based speech formant recovery
    Nelson, D
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1643 - 1646
  • [17] Speech emotion recognition based on formant characteristics feature extraction and phoneme type convergence
    Liu, Zhen-Tao
    Rehman, Abdul
    Wu, Min
    Cao, Wei-Hua
    Hao, Man
    Information Sciences, 2021, 563 : 309 - 325
  • [18] AN MCMC APPROACH TO JOINT ESTIMATION OF CLEAN SPEECH AND NOISE FOR ROBUST SPEECH RECOGNITION
    Mushtaq, Aleem
    Lee, Chin-Hui
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7107 - 7111
  • [19] FORMANT SHIFTING FOR SPEECH INTELLIGIBILITY IMPROVEMENT IN CAR NOISE ENVIRONMENT
    Nathwani, Karan
    Daniel, Morgane
    Richard, Gael
    David, Bertrand
    Roussarie, Vincent
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5375 - 5379
  • [20] A formant frequency estimation scheme for speech signals in the presence of noise
    Fattah, S. A.
    Zhu, W. -P.
    Ahmad, M. O.
    2007 INTERNATIONAL SYMPOSIUM ON SIGNALS, SYSTEMS AND ELECTRONICS, VOLS 1 AND 2, 2007, : 393 - 396