Morphological filtering of spectrograms for automatic speech recognition

被引:0
|
作者
Liu, WM [1 ]
Bastante, VJR [1 ]
Rodriguez, FR [1 ]
Evans, NWD [1 ]
Mason, JSD [1 ]
机构
[1] Univ Coll Swansea, Sch Engn, Swansea, W Glam, Wales
关键词
ASR (automatic speech recognition); segmentation; morphological filtering;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper examines the separation of speech signals from additive noise using a recently proposed signal, noise segmentation approach based on statistical properties of the spectrogram [1,2]. Competitive ASR results were reported in [3] despite using only crude spectrogram shape information suggesting that the approach offers high reliability in identifying regions of different signal dominance and might be robust down to negative SNRs. This paper extends these early results in two directions. First extension investigates the contribution of spectrogram shapes plus magnitudes versus shapes alone, the same ASR experiments as in [3] are repeated but this time with magnitude information recovered in regions deemed to contain speech. Results show consistent improvement for all SNRs down to -5dB. Second extension relates to computational efficiency, a modified one-pass version of the originally iterative process is proposed by deducing empirically an optimal final stopping condition for each SNR. This is found to reduce computational time significantly (factors ranging from 7 to 18) whilst improving ASR accuracy.
引用
收藏
页码:546 / 549
页数:4
相关论文
共 50 条
  • [41] Thai automatic speech recognition
    Suebvisai, S
    Charoenpomsawat, P
    Black, A
    Woszczyna, M
    Schultz, T
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 857 - 860
  • [42] EXPERIMENTS IN AUTOMATIC SPEECH RECOGNITION
    MEEKER, WF
    NELSON, AL
    SCOTT, PB
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1962, 34 (12): : 1996 - &
  • [43] APPROACHES TO AUTOMATIC SPEECH RECOGNITION
    SHOUP, JE
    NAVAL RESEARCH REVIEWS, 1968, 21 (06): : 11 - &
  • [44] Turbo Automatic Speech Recognition
    Receveur, Simon
    Weiss, Robin
    Fingscheidt, Tim
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (05) : 846 - 862
  • [45] NETWORKS FOR SPEECH ENHANCEMENT AND AUTOMATIC SPEECH RECOGNITION
    Vu, Thanh T.
    Bigot, Benjamin
    Chng, Eng Siong
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 499 - 503
  • [46] Automatic speech recognition and intrinsic speech variation
    Benzeguiba, M.
    De Mori, R.
    Deroo, O.
    Dupont, S.
    Erbes, T.
    Jouvet, D.
    Fissore, L.
    Laface, R.
    Mertins, A.
    Ris, C.
    Rose, R.
    Tyagi, V.
    Wellekens, C.
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 5879 - 5882
  • [47] AN APPLICATION OF AUTOMATIC SPEECH RECOGNITION
    HENTHORN, KS
    MACCORMACK, PJ
    JOURNAL OF MICROCOMPUTER APPLICATIONS, 1982, 5 (03): : 239 - 245
  • [48] DYNAMIC SPECTROGRAMS OF SPEECH
    KOCK, WE
    MILLER, RL
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1952, 24 (06): : 783 - 784
  • [49] A survey on automatic speech recognition
    Nakagawa, Seiichi
    IEICE Transactions on Information and Systems, 2002, E85-D (03) : 465 - 486
  • [50] AUTOMATIC SPEECH RECOGNITION SYSTEM
    RUSKE, G
    UMSCHAU IN WISSENSCHAFT UND TECHNIK, 1979, 79 (18) : 566 - 572