Feature enhancement for a bitstream-based front-end in wireless speech recognition

被引:0
|
作者
Kim, HK [1 ]
Cox, RV [1 ]
机构
[1] AT&T Labs Res, Florham Pk, NJ 07932 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a feature enhancement algorithm for wireless speech recognition in adverse acoustic environments. A speech recognition system is realized at the receiver side of a wireless communications system and feature parameters are extracted directly from the bitstream of the speech coder employed in the system. The feature parameters are composed of spectral envelope and coder-specific information. The proposed feature enhancement algorithm incorporates feature parameters obtained from the decoded speech and an enhanced version into the bitstream-based feature parameters. Moreover, the coder-specific parameters are improved by reestimating the codebook gains and residual energy from the enhanced residual signal. HMM-based connected digit recognition experiments show that the proposed feature enhancement algorithm significantly improves recognition accuracy at low SNR without causing poorer performance at high SNR.
引用
收藏
页码:241 / 244
页数:4
相关论文
共 50 条
  • [1] A bitstream-based front-end for wireless speech recognition on IS-136 communications system
    Kim, HK
    Cox, RV
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (05): : 558 - 568
  • [2] Performance improvement of a bitstream-based front-end for wireless speech recognition in adverse environments
    Kim, HK
    Cox, RV
    Rose, RC
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (08): : 591 - 604
  • [3] Bitstream-based feature extraction for wireless speech recognition
    Kim, HK
    Cox, RV
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1607 - 1610
  • [4] A Front-End Speech Enhancement System for Robust Automotive Speech Recognition
    Wang, Haikun
    Ye, Zhongfu
    Chen, Jingdong
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 1 - 5
  • [5] Optimization of Speech Enhancement Front-end with Speech Recognition-level Criterion
    Higuchi, Takuya
    Yoshioka, Takuya
    Nakatani, Tomohiro
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3808 - 3812
  • [6] Front-End Feature Compensation for Noise Robust Speech Emotion Recognition
    Pandharipande, Meghna
    Chakraborty, Rupayan
    Panda, Ashish
    Das, Biswajit
    Kopparapu, Sunil Kumar
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [7] A Reassigned Front-End for Speech Recognition
    Tryfou, Georgina
    Omologo, Maurizio
    [J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 553 - 557
  • [8] The speech recognition based on the bark wavelet front-end processing
    Zhang, XY
    Jiao, ZP
    Zhao, ZF
    [J]. FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 302 - 305
  • [9] Front-end Feature Compensation and Denoising for Noise Robust Speech Emotion Recognition
    Chakraborty, Rupayan
    Panda, Ashish
    Pandharipande, Meghna
    Joshi, Sonal
    Kopparapu, Sunil Kumar
    [J]. INTERSPEECH 2019, 2019, : 3257 - 3261
  • [10] Wavelet-based Front-End for Electromyographic Speech Recognition
    Wand, Michael
    Jou, Szu-Chen Stan
    Schultz, Tanja
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1773 - +