Noise Robust Formant Frequency Estimation Method Based on Spectral Model of Repeated Autocorrelation of Speech

被引:7
|
作者
Jameel, Abu Shafin Mohammad Mahdee [1 ]
Fattah, Shaikh Anowarul [2 ]
Goswami, Rajib [3 ]
Zhu, Wei-Ping [4 ]
Ahmad, M. Omair [4 ]
机构
[1] Rensselaer Polytech Inst, Dept Elect Comp & Syst Engn, Troy, NY 12180 USA
[2] Bangladesh Univ Engn & Technol, Dept Elect & Elect Engn, Dhaka 1000, Bangladesh
[3] Univ Texas San Antonio, Dept Elect & Comp Engn, San Antonio, TX 78249 USA
[4] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ H3G 1M8, Canada
关键词
Autocorrelation; formant estimation; repeated autocorrelation; speech analysis; spectrum; spectral model; TRACKING; REPRESENTATION;
D O I
10.1109/TASLP.2016.2625423
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a noise robust formant frequency estimation scheme is developed based on a spectral model matching algorithm. Considering the vocal tract as an autoregressive system, a spectral model of repeated autocorrelation function (RACF) of band-limited speech signal is proposed. It is shown that because of the repeated autocorrelation operation on band-limited signal, the proposed model can exhibit prominent formant characteristics. First from given noisy speech observations, an adaptive band selection criterion is developed. Next, on each resulting band-limited noisy speech signal, a repeated autocorrelation operation is carried out, which not only reduces the effect of noise but also strengthens the dominant poles corresponding to the formant frequencies. Finally, spectrum of the RACF is computed and instead of direct spectral peak picking, a model fitting scheme is introduced to find out model parameters which lead to formant estimation. The proposed algorithm has been tested on natural vowels as well as some naturally spoken sentences in the presence of different environmental noises. It is found that the proposed scheme provides better formant estimation accuracy in comparison to some of the existing methods at low levels of signal-to-noise ratio.
引用
收藏
页码:1357 / 1370
页数:14
相关论文
共 50 条
  • [21] A robust algorithm for formant frequency extraction of noisy speech
    Zhao, QF
    Shimamura, T
    Suzuki, J
    [J]. ISCAS '98 - PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, 1998, : D534 - D537
  • [22] HMM-Based Estimation of Unreliable Spectral Components for Noise Robust Speech Recognition
    Borgstroem, Bengt J.
    Alwan, Abeer
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1769 - 1772
  • [23] A Noise Robust Speech Recognition Method Using Model Compensation Based on Speech Enhancement
    Shen, Guanghu
    Jung, Ho-Youl
    Chung, Hyun-Yeol
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2008, 27 (04): : 191 - 199
  • [24] An Approach for Formant Based Speech Recognition in Noise
    Fattah, Shaikh Anowarul
    Ghosh, Tonmoy
    Das, Apurba Kumar
    Goswami, Rajib
    Shafin, Abu
    Jameel, Mohammad Mahdee
    Shahnaz, Celia
    [J]. TENCON 2012 - 2012 IEEE REGION 10 CONFERENCE: SUSTAINABLE DEVELOPMENT THROUGH HUMANITARIAN TECHNOLOGY, 2012,
  • [25] Features based on filtering and spectral peaks in autocorrelation domain for robust speech recognition
    Farahani, G.
    Ahadi, S. M.
    Homayounpour, M. M.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2007, 21 (01): : 187 - 205
  • [26] Method of Noise-Robust Estimation of Parameters of an Autoregressive Model in the Frequency Domain
    Zadiraka, V. K.
    Semenov, V. Yu.
    Semenova, Ye. V.
    [J]. CYBERNETICS AND SYSTEMS ANALYSIS, 2021, 57 (05) : 836 - 842
  • [27] Method of Noise-Robust Estimation of Parameters of an Autoregressive Model in the Frequency Domain
    V. K. Zadiraka
    V. Yu. Semenov
    Ye. V. Semenova
    [J]. Cybernetics and Systems Analysis, 2021, 57 : 836 - 842
  • [28] FORMANT CONTOUR ESTIMATION IN NOISE - COMPARISON OF A NEW ZERO-CROSSING BASED METHOD WITH OTHER SPECTRAL ESTIMATORS
    SREENIVAS, TV
    NIEDERJOHN, RJ
    [J]. IEEE INTERNATIONAL CONFERENCE ON SYSTEMS ENGINEERING ///, 1989, : 399 - 402
  • [29] Speech formant frequency and pitch estimation using instantaneous complex frequency
    Kaniewska, Magdalena
    [J]. ICSES 2008 INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS, CONFERENCE PROCEEDINGS, 2008, : 493 - 496
  • [30] Speech Enhancement Based on Spectral Estimation from Higher-lag Autocorrelation
    Shannon, Benjamin J.
    Paliwal, Kuldip K.
    Nadeu, Climent
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1427 - 1430