Application of Inverse Filtering in Enhancement of Whisper Recognition

被引:0
|
作者
Grozdic, Dorde T. [1 ,2 ]
Jovicic, Slobodan T. [1 ,2 ]
Galic, Jovan [3 ]
Markovic, Branko [4 ]
机构
[1] Univ Belgrade, Sch Elect Engn, Bulevar Kralja Aleksandra 73, Belgrade 11000, Serbia
[2] Life Act Adv Ctr, Lab Forens Acoust & Phonet, Belgrade 11000, Serbia
[3] Univ Banja Luka, Fac Elect Engn, Banja Luka, Bosnia & Herceg
[4] Cacak Tech Coll, Cacak, Serbia
关键词
ANN; Inverse filtering; MPL; Speech recognition; Whisper;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The differences between normal speech and whisper, particularly in terms of their acoustic characteristics, are serious problem of ASR (Automatic Speech Recognition) systems. This paper presents the preliminary results of the new way of speech signal pre-processing, which is based on inverse filtering. This method of signal pre-processing improves whisper recognition with ANNs (Artificial Neural Networks). The ANNs showed high capabilities in speech and whisper recognition in matched train/test scenarios, with the average recognition accuracy of 99.8%. However, the recognition scores in mismatched train/test scenarios were highly degraded. Because of their practical significance, the mismatched train/test scenarios were analyzed in detail in this research. Particularly, the speech/whisper scenario is important. This scenario corresponds to real life situation when speaker is in front of ASR system and from speech switches to whisper. The use of inverse filter enhanced whisper recognition by 9.48%, which in this scenario amounts 70.25%.
引用
收藏
页码:157 / 161
页数:5
相关论文
共 50 条
  • [41] Application of spatio-temporal filtering to fetal electrocardiogram enhancement
    Kotas, M.
    Jezewski, J.
    Horoba, K.
    Matonia, A.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2011, 104 (01) : 1 - 9
  • [42] A kepstrum approach to filtering, smoothing and prediction with application to speech enhancement
    Moir, TJ
    Barrett, JF
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2003, 459 (2040): : 2957 - 2976
  • [43] Application of Perceptual Filtering Models to Noisy Speech Signals Enhancement
    Zoghlami, Novlene
    Lachiri, Zied
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2012, 2012
  • [44] Speech enhancement using an equivalent source inverse filtering-based microphone array
    Bai, Mingsian R.
    Hur, Kur-Nan
    Liu, Ying-Ting
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (03): : 1373 - 1380
  • [45] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
    Huan-Yu Dong
    Chang-Myung Lee
    EURASIP Journal on Audio, Speech, and Music Processing, 2018
  • [46] SEISMIC SIGNAL ENHANCEMENT BY A NEW INVERSE ARRAY FILTERING TECHNIQUE IN OIL-EXPLORATION
    SARMA, GN
    AYACHE, S
    GEOPHYSICS, 1979, 44 (03) : 348 - 349
  • [47] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
    Dong, Huan-Yu
    Lee, Chang-Myung
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
  • [48] FILTERING INVERSE METHOD
    WANG, TW
    XUE, XH
    LIU, RX
    INVERSE PROBLEMS, 1987, 3 (01) : 143 - 148
  • [49] NONLINEAR INVERSE FILTERING
    ROTHENBE.M
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 53 (01): : 294 - &
  • [50] Inverse filtering and deconvolution
    Saberi, A
    Stoorvogel, AA
    Sannuti, P
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2001, 11 (02) : 131 - 156