Application of Inverse Filtering in Enhancement of Whisper Recognition

被引:0
|
作者
Grozdic, Dorde T. [1 ,2 ]
Jovicic, Slobodan T. [1 ,2 ]
Galic, Jovan [3 ]
Markovic, Branko [4 ]
机构
[1] Univ Belgrade, Sch Elect Engn, Bulevar Kralja Aleksandra 73, Belgrade 11000, Serbia
[2] Life Act Adv Ctr, Lab Forens Acoust & Phonet, Belgrade 11000, Serbia
[3] Univ Banja Luka, Fac Elect Engn, Banja Luka, Bosnia & Herceg
[4] Cacak Tech Coll, Cacak, Serbia
关键词
ANN; Inverse filtering; MPL; Speech recognition; Whisper;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The differences between normal speech and whisper, particularly in terms of their acoustic characteristics, are serious problem of ASR (Automatic Speech Recognition) systems. This paper presents the preliminary results of the new way of speech signal pre-processing, which is based on inverse filtering. This method of signal pre-processing improves whisper recognition with ANNs (Artificial Neural Networks). The ANNs showed high capabilities in speech and whisper recognition in matched train/test scenarios, with the average recognition accuracy of 99.8%. However, the recognition scores in mismatched train/test scenarios were highly degraded. Because of their practical significance, the mismatched train/test scenarios were analyzed in detail in this research. Particularly, the speech/whisper scenario is important. This scenario corresponds to real life situation when speaker is in front of ASR system and from speech switches to whisper. The use of inverse filter enhanced whisper recognition by 9.48%, which in this scenario amounts 70.25%.
引用
收藏
页码:157 / 161
页数:5
相关论文
共 50 条
  • [1] Whisper Speech Enhancement Using Joint Variational Autoencoder for Improved Speech Recognition
    Agrawal, Vikas
    Kumar, Shashi
    Rath, Shakti P.
    INTERSPEECH 2021, 2021, : 2706 - 2710
  • [2] Application of inverse filtering on lidar signals
    Stockhausen, N
    Werner, C
    Streicher, J
    Klier, M
    LASER RADAR RANGING AND ATMOSPHERIC LIDAR TECHNIQUES II, 1999, 3865 : 134 - 143
  • [3] APPLICATION OF INVERSE FILTERING FOR DETECTING LARYNGEAL PATHOLOGY
    KOIKE, Y
    MARKEL, J
    ANNALS OF OTOLOGY RHINOLOGY AND LARYNGOLOGY, 1975, 84 (01): : 117 - 124
  • [4] Application of Combined Filtering in Thunder Recognition
    Wang, Yao
    Yang, Jing
    Zhang, Qilin
    Zeng, Jinquan
    Mu, Boyi
    Du, Junzhi
    Li, Zhekai
    Shao, Yuhui
    Wang, Jialei
    Li, Zhouxin
    REMOTE SENSING, 2023, 15 (02)
  • [5] Application of spatial filtering to speech recognition
    THORNE DL
    National Aerospace and Electronics Conference, Proceedings of the IEEE, 1972, : 265 - 269
  • [6] Application of vector filtering to pattern recognition
    Margin-Chagnolleau, I
    Durou, G
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 433 - 436
  • [8] An End to End Method of Whisper Enhancement
    Huang, Yan
    Lian, HaiLun
    Zhou, Jian
    Wang, HuaBin
    Tao, Liang
    2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 246 - 250
  • [9] ENHANCEMENT OF WORD-RECOGNITION PERFORMANCE WITH A FILTERING TECHNIQUE
    WILSON, RH
    PREECE, JP
    CROWTHER, CS
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1991, 34 (06): : 1436 - 1438
  • [10] DETECTION AND CALIBRATION OF WHISPER FOR SPEAKER RECOGNITION
    Kelly, Finnian
    Hansen, John H. L.
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 1060 - 1065