A temporal auditory model with adaptation for automatic speech recognition

被引:0
|
作者
Haque, Serajul [1 ]
Togneri, Roberto [1 ]
Zaknich, Anthony [1 ]
机构
[1] Univ Western Australia, Sch Elect Elect & Comp Engn, Nedlands, WA 6009, Australia
关键词
auditory system; speech recognition; feature extraction; adaptive system; hidden Markov model;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Rapid and short-term adaptation are dynamic mechanisms of human auditory system. An auditory model based on zero-crossings with peak amplitudes (ZCPA) was used as a front-end for automatic speech recognition (ASR) with the perceptual property of adaptation as determined by psychoacoustic observations. The model performance was evaluated on the isolated digits (TIDIGITS) database using continuous density HMM recognizer in additive noise environment. Experimental results indicate that the ASR performance of the ZCPA may be improved with adaptation over the static baseline performance in white Gaussian and factory noise. The perceptual front-end was also evaluated with dynamic (delta and delta-delta) features added to the adaptation. It was observed that adaptation with dynamic features performed better in factory, babble and car noise over a wide range of SNR values.
引用
收藏
页码:1141 / +
页数:2
相关论文
共 50 条
  • [1] Automatic speech recognition with an adaptation model motivated by auditory processing
    Holmberg, M
    Gelbart, D
    Hemmert, W
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 43 - 49
  • [2] Shrinkage Model Adaptation in Automatic Speech Recognition
    Li, Jinyu
    Tsao, Yu
    Lee, Chin-Hui
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1656 - +
  • [3] Roles of computational auditory model in automatic speech recognition
    迟惠生
    吴玺宏
    [J]. Progress in Natural Science:Materials International, 2000, (12) : 9 - 18
  • [4] Roles of computational auditory model in automatic speech recognition
    Chi, HS
    Wu, XH
    [J]. PROGRESS IN NATURAL SCIENCE, 2000, 10 (12): : 887 - 896
  • [5] A model of auditory perception as front end for automatic speech recognition
    Tchorz, J
    Kollmeier, B
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (04): : 2040 - 2050
  • [6] Auditory modeling in automatic recognition of speech
    Hermansky, H
    [J]. SIGNAL ANALYSIS & PREDICTION I, 1997, : 17 - 22
  • [7] AUDITORY MODELING FOR AUTOMATIC SPEECH RECOGNITION
    BEET, SW
    MOORE, RK
    TOMLINSON, MJ
    [J]. PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 571 - 579
  • [8] Chameleon: A Language Model Adaptation Toolkit for Automatic Speech Recognition of Conversational Speech
    Song, Yuanfeng
    Jiang, Di
    Zhao, Weiwei
    Xu, Qian
    Wong, Raymond Chi-Wing
    Yang, Qiang
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2019, : 37 - 42
  • [9] NORMALIZATION AND ADAPTATION OF SPEECH DATA FOR AUTOMATIC SPEECH RECOGNITION
    SCARR, RWA
    [J]. INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1970, 2 (01): : 41 - 59
  • [10] Efficient Language Model Adaptation for Automatic Speech Recognition of Spoken Translations
    Pelemans, Joris
    Vanallemeersch, Tom
    Demuynck, Kris
    Van Hamme, Hugo
    Wambacq, Patrick
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2262 - 2266