Noise Robust Formant Frequency Estimation Method Based on Spectral Model of Repeated Autocorrelation of Speech

被引：7

作者：

Jameel, Abu Shafin Mohammad Mahdee ^{[1
]}

Fattah, Shaikh Anowarul ^{[2
]}

Goswami, Rajib ^{[3
]}

Zhu, Wei-Ping ^{[4
]}

Ahmad, M. Omair ^{[4
]}

机构：

[1] Rensselaer Polytech Inst, Dept Elect Comp & Syst Engn, Troy, NY 12180 USA

[2] Bangladesh Univ Engn & Technol, Dept Elect & Elect Engn, Dhaka 1000, Bangladesh

[3] Univ Texas San Antonio, Dept Elect & Comp Engn, San Antonio, TX 78249 USA

[4] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ H3G 1M8, Canada

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2017年 / 25卷 / 06期

关键词：

Autocorrelation; formant estimation; repeated autocorrelation; speech analysis; spectrum; spectral model; TRACKING; REPRESENTATION;

D O I：

10.1109/TASLP.2016.2625423

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, a noise robust formant frequency estimation scheme is developed based on a spectral model matching algorithm. Considering the vocal tract as an autoregressive system, a spectral model of repeated autocorrelation function (RACF) of band-limited speech signal is proposed. It is shown that because of the repeated autocorrelation operation on band-limited signal, the proposed model can exhibit prominent formant characteristics. First from given noisy speech observations, an adaptive band selection criterion is developed. Next, on each resulting band-limited noisy speech signal, a repeated autocorrelation operation is carried out, which not only reduces the effect of noise but also strengthens the dominant poles corresponding to the formant frequencies. Finally, spectrum of the RACF is computed and instead of direct spectral peak picking, a model fitting scheme is introduced to find out model parameters which lead to formant estimation. The proposed algorithm has been tested on natural vowels as well as some naturally spoken sentences in the presence of different environmental noises. It is found that the proposed scheme provides better formant estimation accuracy in comparison to some of the existing methods at low levels of signal-to-noise ratio.

引用

页码：1357 / 1370

页数：14

共 50 条

[21] A robust algorithm for formant frequency extraction of noisy speech
Zhao, QF
Shimamura, T
Suzuki, J
[J]. ISCAS '98 - PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, 1998, : D534 - D537
[22] HMM-Based Estimation of Unreliable Spectral Components for Noise Robust Speech Recognition
Borgstroem, Bengt J.
Alwan, Abeer
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1769 - 1772
[23] A Noise Robust Speech Recognition Method Using Model Compensation Based on Speech Enhancement
Shen, Guanghu
Jung, Ho-Youl
Chung, Hyun-Yeol
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2008, 27 (04): : 191 - 199
[24] An Approach for Formant Based Speech Recognition in Noise
Fattah, Shaikh Anowarul
Ghosh, Tonmoy
Das, Apurba Kumar
Goswami, Rajib
Shafin, Abu
Jameel, Mohammad Mahdee
Shahnaz, Celia
[J]. TENCON 2012 - 2012 IEEE REGION 10 CONFERENCE: SUSTAINABLE DEVELOPMENT THROUGH HUMANITARIAN TECHNOLOGY, 2012,
[25] Features based on filtering and spectral peaks in autocorrelation domain for robust speech recognition
Farahani, G.
Ahadi, S. M.
Homayounpour, M. M.
[J]. COMPUTER SPEECH AND LANGUAGE, 2007, 21 (01): : 187 - 205
[26] Method of Noise-Robust Estimation of Parameters of an Autoregressive Model in the Frequency Domain
Zadiraka, V. K.
Semenov, V. Yu.
Semenova, Ye. V.
[J]. CYBERNETICS AND SYSTEMS ANALYSIS, 2021, 57 (05) : 836 - 842
[27] Method of Noise-Robust Estimation of Parameters of an Autoregressive Model in the Frequency Domain
V. K. Zadiraka
V. Yu. Semenov
Ye. V. Semenova
[J]. Cybernetics and Systems Analysis, 2021, 57 : 836 - 842
[28] FORMANT CONTOUR ESTIMATION IN NOISE - COMPARISON OF A NEW ZERO-CROSSING BASED METHOD WITH OTHER SPECTRAL ESTIMATORS
SREENIVAS, TV
NIEDERJOHN, RJ
[J]. IEEE INTERNATIONAL CONFERENCE ON SYSTEMS ENGINEERING ///, 1989, : 399 - 402
[29] Speech formant frequency and pitch estimation using instantaneous complex frequency
Kaniewska, Magdalena
[J]. ICSES 2008 INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS, CONFERENCE PROCEEDINGS, 2008, : 493 - 496
[30] Speech Enhancement Based on Spectral Estimation from Higher-lag Autocorrelation
Shannon, Benjamin J.
Paliwal, Kuldip K.
Nadeu, Climent
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1427 - 1430

← 1 2 3 4 5 →