Multi-band speech recognition in noisy environments

被引:0
|
作者
Okawa, S [1 ]
Bocchieri, E [1 ]
Potamianos, A [1 ]
机构
[1] AT&T Bell Labs, Res, Florham Park, NJ 07932 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a new approach for multi-band based automatic speech recognition (ASR). Recent work by Bourlard and Hermansky suggests that multi-band ASR gives more accurate recognition, especially in noisy acoustic environments, by combining the likelihoods of different frequency bands. Here we evaluate this likelihood recombination (LC) approach to multi-band ASR, and propose an alternative method, namely feature recombination (FC). In the FC system, after different acoustic analyzers are applied to each sub-band individually, a vector is composed by combining the sub-band features. The speech classifier then calculates the likelihood from the single vector. Thus, band-limited noise affects only few of the feature components, as in multi-band LC system, but, at the same time, all feature components are jointly modeled, as in conventional ASR. The experimental results show that the FC system can yield better performance than both the conventional ASR and the LC strategy for noisy speech.
引用
收藏
页码:641 / 644
页数:4
相关论文
共 50 条
  • [1] Combining multi-band and frequency-filtering techniques for speech recognition in noisy environments
    Jancovic, P
    Ming, J
    Hanna, P
    Stewart, D
    Smith, J
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 265 - 270
  • [2] Multi-band automatic speech recognition
    Cerisara, C
    Fohr, D
    COMPUTER SPEECH AND LANGUAGE, 2001, 15 (02): : 151 - 174
  • [3] Asynchrony in multi-band speech recognition
    Cerisara, C
    Fohr, D
    Haton, JP
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1121 - 1124
  • [4] A recombination model for multi-band speech recognition
    Cerisara, C
    Haton, JP
    Mari, JF
    Fohr, D
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 717 - 720
  • [5] Dynamic Bayesian networks for multi-band automatic speech recognition
    Daoudi, K
    Fohr, D
    Antoine, C
    COMPUTER SPEECH AND LANGUAGE, 2003, 17 (2-3): : 263 - 285
  • [6] Continuous multi-band speech recognition using Bayesian networks
    Daoudi, K
    Fohr, D
    Antoine, C
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 41 - 44
  • [7] Multi-band summary correlogram-based pitch detection for noisy speech
    Tan, Lee Ngee
    Alwan, Abeer
    SPEECH COMMUNICATION, 2013, 55 (7-8) : 841 - 856
  • [8] Robust Speech Recognition Based on Multi-band Spectral Subtraction
    Wan, Yi-Long
    Zhang, Tian-Qi
    Wang, Zhi-Chao
    Jin, Jing
    2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 36 - 40
  • [9] SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY
    GONG, YF
    SPEECH COMMUNICATION, 1995, 16 (03) : 261 - 291
  • [10] An efficient multi-band spectral subtraction method for robust speech recognition
    Safayani, M.
    Sameti, H.
    Babaali, B.
    Shalmani, M. T. Manzuri
    2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 748 - 751