Auditory-based Formant Estimation in Noise using a Probabilistic Framework

被引:0
|
作者
Glaeser, Claudius [1 ]
Heckmann, Martin [1 ]
Joublin, Frank [1 ]
Goerick, Christian [1 ]
机构
[1] Honda Res Inst Europe, D-63073 Offenbach, Germany
关键词
speech processing; formant extraction; tracking; robustness; Bayes procedures;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We recently introduced a computationally efficient framework for tracking formants which combines a biologically inspired preprocessing for enhancing formants in spectrograms with a probabilistic framework for estimating formant trajectories. In contrast to previously published approaches our tracking scheme relies on the joint distribution of formants rather than using independent tracking instances for each formant separately. Therewith more precise formant estimates could be obtained. In this paper we will briefly review our algorithm and extend it by using more sophisticated models of the formants underlying dynamics. Furthermore, we will dwell on the robustness of our method for speech degraded by various types of noise. A comprehensive evaluation on a large publicly available database containing hand-labeled formant trajectories shows significant performance improvements in both clean and noisy speech compared to state of the art approaches.
引用
收藏
页码:2606 / 2609
页数:4
相关论文
共 50 条
  • [31] Respiratory sound classification utilizing human auditory-based feature extraction
    Rishabh, Dhirendra
    Kumar, Dhirendra
    Meena, Yogendra
    Singh, Kuldeep
    PHYSICA SCRIPTA, 2025, 100 (04)
  • [32] Auditory-based distortion measure with application to concatenative speech synthesis
    Duke Univ, Durham, United States
    IEEE Trans Speech Audio Process, 5 (489-495):
  • [33] Partial maintenance of auditory-based cognitive training benefits in older adults
    Anderson, Samira
    White-Schwoch, Travis
    Choi, Hee Jae
    Kraus, Nina
    NEUROPSYCHOLOGIA, 2014, 62 : 286 - 296
  • [34] Matlab-based Formant Estimation
    Luo Jiao-yan
    Sun Xiang-e
    Applied Decisions in Area of Mechanical Engineering and Industrial Manufacturing, 2014, 577 : 798 - 801
  • [35] Selective attention in an overcrowded auditory scene: Implications for auditory-based brain-computer interface design
    Maddox, Ross K.
    Cheung, Willy
    Lee, Adrian K. C.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (05): : EL385 - EL390
  • [36] A formant frequency estimation scheme for speech signals in the presence of noise
    Fattah, S. A.
    Zhu, W. -P.
    Ahmad, M. O.
    2007 INTERNATIONAL SYMPOSIUM ON SIGNALS, SYSTEMS AND ELECTRONICS, VOLS 1 AND 2, 2007, : 393 - 396
  • [37] An Approach for Formant Based Speech Recognition in Noise
    Fattah, Shaikh Anowarul
    Ghosh, Tonmoy
    Das, Apurba Kumar
    Goswami, Rajib
    Shafin, Abu
    Jameel, Mohammad Mahdee
    Shahnaz, Celia
    TENCON 2012 - 2012 IEEE REGION 10 CONFERENCE: SUSTAINABLE DEVELOPMENT THROUGH HUMANITARIAN TECHNOLOGY, 2012,
  • [38] Probabilistic Compression Artifacts Reduction Using Self-Similarity Based Noise Region Estimation
    Lee, Oh-Young
    Ryu, Je-Ho
    Kim, Jong-Ok
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 784 - 788
  • [39] FORMANT ESTIMATION ALGORITHM BASED ON POLE FOCUSING OFFERING IMPROVED NOISE TOLERANCE AND FEATURE RESOLUTION
    DUNCAN, G
    JACK, MA
    IEE PROCEEDINGS-F RADAR AND SIGNAL PROCESSING, 1988, 135 (01) : 18 - 32
  • [40] Noise Robust Formant Frequency Estimation Method Based on Spectral Model of Repeated Autocorrelation of Speech
    Jameel, Abu Shafin Mohammad Mahdee
    Fattah, Shaikh Anowarul
    Goswami, Rajib
    Zhu, Wei-Ping
    Ahmad, M. Omair
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1357 - 1370