Auditory-based Formant Estimation in Noise using a Probabilistic Framework

被引：0

作者：

Glaeser, Claudius ^{[1
]}

Heckmann, Martin ^{[1
]}

Joublin, Frank ^{[1
]}

Goerick, Christian ^{[1
]}

机构：

[1] Honda Res Inst Europe, D-63073 Offenbach, Germany

来源：

INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年

关键词：

speech processing; formant extraction; tracking; robustness; Bayes procedures;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We recently introduced a computationally efficient framework for tracking formants which combines a biologically inspired preprocessing for enhancing formants in spectrograms with a probabilistic framework for estimating formant trajectories. In contrast to previously published approaches our tracking scheme relies on the joint distribution of formants rather than using independent tracking instances for each formant separately. Therewith more precise formant estimates could be obtained. In this paper we will briefly review our algorithm and extend it by using more sophisticated models of the formants underlying dynamics. Furthermore, we will dwell on the robustness of our method for speech degraded by various types of noise. A comprehensive evaluation on a large publicly available database containing hand-labeled formant trajectories shows significant performance improvements in both clean and noisy speech compared to state of the art approaches.

引用

页码：2606 / 2609

页数：4

共 50 条

[31] Respiratory sound classification utilizing human auditory-based feature extraction
Rishabh, Dhirendra
Kumar, Dhirendra
Meena, Yogendra
Singh, Kuldeep
PHYSICA SCRIPTA, 2025, 100 (04)
[32] Auditory-based distortion measure with application to concatenative speech synthesis
Duke Univ, Durham, United States
IEEE Trans Speech Audio Process, 5 (489-495):
[33] Partial maintenance of auditory-based cognitive training benefits in older adults
Anderson, Samira
White-Schwoch, Travis
Choi, Hee Jae
Kraus, Nina
NEUROPSYCHOLOGIA, 2014, 62 : 286 - 296
[34] Matlab-based Formant Estimation
Luo Jiao-yan
Sun Xiang-e
Applied Decisions in Area of Mechanical Engineering and Industrial Manufacturing, 2014, 577 : 798 - 801
[35] Selective attention in an overcrowded auditory scene: Implications for auditory-based brain-computer interface design
Maddox, Ross K.
Cheung, Willy
Lee, Adrian K. C.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (05): : EL385 - EL390
[36] A formant frequency estimation scheme for speech signals in the presence of noise
Fattah, S. A.
Zhu, W. -P.
Ahmad, M. O.
2007 INTERNATIONAL SYMPOSIUM ON SIGNALS, SYSTEMS AND ELECTRONICS, VOLS 1 AND 2, 2007, : 393 - 396
[37] An Approach for Formant Based Speech Recognition in Noise
Fattah, Shaikh Anowarul
Ghosh, Tonmoy
Das, Apurba Kumar
Goswami, Rajib
Shafin, Abu
Jameel, Mohammad Mahdee
Shahnaz, Celia
TENCON 2012 - 2012 IEEE REGION 10 CONFERENCE: SUSTAINABLE DEVELOPMENT THROUGH HUMANITARIAN TECHNOLOGY, 2012,
[38] Probabilistic Compression Artifacts Reduction Using Self-Similarity Based Noise Region Estimation
Lee, Oh-Young
Ryu, Je-Ho
Kim, Jong-Ok
2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 784 - 788
[39] FORMANT ESTIMATION ALGORITHM BASED ON POLE FOCUSING OFFERING IMPROVED NOISE TOLERANCE AND FEATURE RESOLUTION
DUNCAN, G
JACK, MA
IEE PROCEEDINGS-F RADAR AND SIGNAL PROCESSING, 1988, 135 (01) : 18 - 32
[40] Noise Robust Formant Frequency Estimation Method Based on Spectral Model of Repeated Autocorrelation of Speech
Jameel, Abu Shafin Mohammad Mahdee
Fattah, Shaikh Anowarul
Goswami, Rajib
Zhu, Wei-Ping
Ahmad, M. Omair
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1357 - 1370

← 1 2 3 4 5 →