Improving speech detection robustness for wireless speech recognition

被引:0
|
作者
Karray, L [1 ]
Mauuary, L [1 ]
机构
[1] CNET, FT, RCP, DIH, F-22307 Lannion, France
关键词
D O I
10.1109/ASRU.1997.659120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of speech recognition systems shows that noise and channel effects are very disturbing, and an efficient detection of speech/non-speech segments is necessary, Preprocessing the speech signal is one of the adopted solutions to improve recognition performance. In this paper, spectral subtraction is used as a preprocessing technique aiming to increase the robustness to noisy conditions. Results of several experiments carried out on a database collected over GSM network show that spectral subtraction improves the global recognizer performances, especially in very noisy environments. We shaw that the improvements concern mainly noise/speech detection module.
引用
收藏
页码:428 / 435
页数:8
相关论文
共 50 条
  • [21] Compensation of speech coding distortion for wireless speech recognition
    Kim, HK
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (06): : 1596 - 1600
  • [22] IMPROVING NOISE ROBUSTNESS OF CONTRASTIVE SPEECH REPRESENTATION LEARNING WITH SPEECH RECONSTRUCTION
    Wang, Heming
    Qian, Yao
    Wang, Xiaofei
    Wang, Yiming
    Wang, Chengyi
    Liu, Shujie
    Yoshioka, Takuya
    Li, Jinyu
    Wang, DeLiang
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6062 - 6066
  • [23] Speech recognition for wireless applications
    Weerackody, V
    Reichl, W
    Potamianos, A
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-10, CONFERENCE RECORD, 2001, : 1047 - 1051
  • [24] Dual-TBNet: Improving the Robustness of Speech Features via Dual-Transformer-BiLSTM for Speech Emotion Recognition
    Liu, Zheng
    Kang, Xin
    Ren, Fuji
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2193 - 2203
  • [25] Distorted speech rejection for automatic speech recognition in wireless communication
    Chang, JH
    Kim, NS
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (07) : 1978 - 1981
  • [26] Quantifying and Improving the Performance of Speech Recognition Systems on Dysphonic Speech
    Lopez, Julio C. Hidalgo C.
    Sandeep, Shelly
    Wright, MaKayla
    Wandell, Grace M. M.
    Law, Anthony B. B.
    [J]. OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2023, 168 (05) : 1130 - 1138
  • [27] An Adaptive Speech Speed Algorithm for Improving Continuous Speech Recognition
    Zhu, Jinwei
    Chen, Huan
    Wen, Xing
    Huang, Zhenlin
    Zhao, Liuqi
    [J]. ACM International Conference Proceeding Series, 2023, : 606 - 610
  • [28] THE DEREVERBERATION OF SPEECH WITH A VIEW TO IMPROVING THE AUTOMATIC RECOGNITION OF SPEECH IN ROOMS
    HIRSCH, HG
    [J]. ACUSTICA, 1989, 67 (03): : 216 - 221
  • [29] IMPROVING SPEECH RECOGNITION USING CONSISTENT PREDICTIONS ON SYNTHESIZED SPEECH
    Wang, Gary
    Rosenberg, Andrew
    Chen, Zhehuai
    Zhang, Yu
    Ramabhadran, Bhuvana
    Wu, Yonghui
    Moreno, Pedro
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7029 - 7033
  • [30] Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations
    Oneata, Dan
    Cucu, Horia
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4578 - 4587