MAP-based perceptual modeling for noisy speech recognition

被引:0
|
作者
Institute of Biomedical Engineering, National Cheng Kung University, Tainan, 701, Taiwan [1 ]
不详 [2 ]
不详 [3 ]
不详 [4 ]
不详 [5 ]
机构
来源
J. Inf. Sci. Eng. | 2006年 / 5卷 / 999-1013期
关键词
Computer simulation - Mathematical models - Noise abatement - Spurious signal noise;
D O I
暂无
中图分类号
学科分类号
摘要
This study presents a maximum a posteriori (MAP) based perceptual modeling approach to deal with the issue of recognition degradation in noisy environment. In this approach, MAP-based noise detection is first applied to identify the noise segment in an utterance. Subtractive-type enhancement algorithm with masking properties of the human auditory system is then used to reduce the noise effect. Finally, MAP-based incremental noise model adaptation is developed to overcome the model inconsistencies between training and testing environments. For performance evaluation of the proposed approach, a Mandarin keyword recognition system was constructed. The experimental results show that the proposed approach achieves a better recognition rate compared to the audible noise suppression (ANS) and parallel model combination (PMC) methods.
引用
收藏
相关论文
共 50 条
  • [1] MAP-based perceptual modeling for noisy speech recognition
    Sher, Yung-Ji
    Chen, Yeou-Jiunn
    Chiu, Yu-Hsien
    Chung, Kao-Chi
    Wu, Chung-Hsien
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2006, 22 (05) : 999 - 1013
  • [2] Perceptual speech modeling for noisy speech recognition
    Wu, CH
    Chiu, YH
    Lim, H
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 385 - 388
  • [3] Perceptual features for automatic speech recognition in noisy environments
    Haque, Serajul
    Togneri, Roberto
    Zaknich, Anthony
    SPEECH COMMUNICATION, 2009, 51 (01) : 58 - 75
  • [4] Perceptual harmonic cepstral coefficients for speech recognition in noisy environment
    Gu, L
    Rose, K
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 125 - 128
  • [5] Noisy speech recognition based on speech enhancement
    Wang, Xia
    Tang, Hongmei
    Zhao, Xiaoqun
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 713 - +
  • [6] Concept map-based knowledge modeling
    Coffey, JW
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL I, PROCEEDINGS: INFORMATION SYSTEMS, TECHNOLOGIES AND APPLICATIONS, 2004, : 361 - 365
  • [7] Subspace Modeling and Selection for Noisy Speech Recognition
    Chien, Jen-Tzung
    Ting, Chuan-Wei
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 789 - 792
  • [8] Noisy Chinese digit string speech recognition based on tone modeling
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
    不详
    Shengxue Xuebao, 2007, 5 (454-460):
  • [9] Incorporating Regional Information to Enhance MAP-based Stochastic Feature Compensation for Robust Speech Recognition
    Tsao, Yu
    Dixon, Paul R.
    Hori, Chiori
    Kawai, Hisashi
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2596 - 2599
  • [10] Multivariate Autoregressive Spectrogram Modeling for Noisy Speech Recognition
    Ganapathy, Sriram
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (09) : 1373 - 1377