Auditory-based Formant Estimation in Noise using a Probabilistic Framework

被引:0
|
作者
Glaeser, Claudius [1 ]
Heckmann, Martin [1 ]
Joublin, Frank [1 ]
Goerick, Christian [1 ]
机构
[1] Honda Res Inst Europe, D-63073 Offenbach, Germany
关键词
speech processing; formant extraction; tracking; robustness; Bayes procedures;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We recently introduced a computationally efficient framework for tracking formants which combines a biologically inspired preprocessing for enhancing formants in spectrograms with a probabilistic framework for estimating formant trajectories. In contrast to previously published approaches our tracking scheme relies on the joint distribution of formants rather than using independent tracking instances for each formant separately. Therewith more precise formant estimates could be obtained. In this paper we will briefly review our algorithm and extend it by using more sophisticated models of the formants underlying dynamics. Furthermore, we will dwell on the robustness of our method for speech degraded by various types of noise. A comprehensive evaluation on a large publicly available database containing hand-labeled formant trajectories shows significant performance improvements in both clean and noisy speech compared to state of the art approaches.
引用
收藏
页码:2606 / 2609
页数:4
相关论文
共 50 条
  • [41] Auditory-Based Multi-Scale Amplitude-Aware Permutation Entropy as a Measure for Feature Extraction of Ship Radiated Noise
    Wang, Ping
    Chen, Mingsong
    Wang, Junyi
    Deng, Xiaofang
    Chen, Zhe
    2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 1550 - 1555
  • [42] Application of an Auditory-Based Feedback Distortion to Modify Gait Symmetry in Healthy Individuals
    Liu, Le Yu
    Sangani, Samir
    Patterson, Kara K.
    Fung, Joyce
    Lamontagne, Anouk
    BRAIN SCIENCES, 2024, 14 (08)
  • [43] A new auditory-based index to evaluate the blind separation performance of acoustic mixtures
    Sanchis, JM
    Rieta, JJ
    Castells, F
    Millet, J
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, 2004, 3195 : 1118 - 1125
  • [44] EFFECTS OF AN AUDITORY-BASED TRAINING PROGRAM ON ATTENTION AMONG OLDER ADULTS
    O'Brien, Jennifer L.
    Lister, Jennifer J.
    Sparkman, Susanne
    Clifton, Kyle
    Williams, Victoria
    PSYCHOPHYSIOLOGY, 2014, 51 : S14 - S14
  • [45] Native and non-native class discrimination using speech rhythm- and auditory-based cues
    Selouani, S. -A.
    Alotaibi, Y.
    Cichocki, W.
    Gharsellaoui, S.
    Kadi, K.
    COMPUTER SPEECH AND LANGUAGE, 2015, 31 (01): : 28 - 48
  • [46] Formant Estimation and Tracking using Deep Learning
    Dissen, Yehoshua
    Keshet, Joseph
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 958 - 962
  • [47] Probing the independence of formant control using altered auditory feedback
    MacDonald, Ewen N.
    Purcell, David W.
    Munhall, Kevin G.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 129 (02): : 955 - 965
  • [48] An approach to formant frequency estimation at low signal-to-noise ratio
    Fattah, S. A.
    Zhu, W. -P.
    Ahmad, M. O.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 469 - +
  • [49] Auditory Development in Early Amplified Children: Factors Influencing Auditory-Based Communication Outcomes in Children with Hearing Loss
    Sininger, Yvonne S.
    Grimes, Alison
    Christensen, Elizabeth
    EAR AND HEARING, 2010, 31 (02): : 166 - 185
  • [50] DUAL-CHANNEL ITERATIVE SPEECH ENHANCEMENT WITH CONSTRAINTS ON AN AUDITORY-BASED SPECTRUM
    NANDKUMAR, S
    HANSEN, JHL
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01): : 22 - 34