Robust speech recognition with selective input data to a NN classifier

被引:0
|
作者
Cong, L [1 ]
Asghar, S [1 ]
机构
[1] Adv Micro Devices, Austin, TX 78741 USA
关键词
neural networks; speech recognition;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the new robust IWSR systems which provide selective input data to a neural network in response to speech signal to acoustic noise ratios to improve speech recognition system performance The proposed robust isolated word speech recognition systems employ FMQ/MQ as the spectral labelling process, followed by a Hidden Markov Model (HMM), or a HMM and Neural Network (HMM/MLP) classification techniqut. NOSEX92 has been used as the speech and noise database. This robust performance ensures that a recognition accuracy of the order 98.9% and 95% is obtained with SNR at 10 dB and 0 dB respectively.
引用
收藏
页码:1817 / 1824
页数:8
相关论文
共 50 条
  • [1] Speech recognition using hybrid hidden Markov model and NN classifier
    Kundu, A.
    Bayya, A.
    [J]. International Journal of Speech Technology, 1998, 2 (03): : 227 - 240
  • [2] Speech recognition using hybrid hidden Markov model and NN classifier
    Kundu A.
    Bayya A.
    [J]. International Journal of Speech Technology, 1998, 2 (3) : 227 - 240
  • [3] Noise-robust speech recognition using top-down selective attention with an HMM classifier
    Lee, Chang-Hoon
    Lee, Soo-Young
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (07) : 489 - 491
  • [4] AN IMPROVED METHOD IN SPEECH SIGNAL INPUT REPRESENTATION BASED ON DTW TECHNIQUE FOR NN SPEECH RECOGNITION SYSTEM
    Sudirman, Rubita
    Salleh, Sh. Hussain
    Salleh, Shaharuddin
    [J]. JURNAL TEKNOLOGI, 2007, 46
  • [5] A STUDY ON DATA AUGMENTATION OF REVERBERANT SPEECH FOR ROBUST SPEECH RECOGNITION
    Ko, Tom
    Peddinti, Vijayaditya
    Povey, Daniel
    Seltzer, Michael L.
    Khudanpur, Sanjeev
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5220 - 5224
  • [6] Missing data techniques for robust speech recognition
    Cooke, M
    Morris, A
    Green, P
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 863 - 866
  • [7] Speech recognition for data input and control systems in agriculture
    Hirschfeld, D
    Boettinger, S
    Scharlau, C
    [J]. CONFERENCE: AGRICULTURAL ENGINEERING 2004, 2004, 1855 : 151 - 158
  • [8] Classifier combination in speech recognition
    Felföldi, László
    Kocsor, András
    Tóth, Lászlo
    [J]. Periodica Polytechnica Electrical Engineering, 2003, 47 (1-2): : 125 - 140
  • [9] Robust Submodular Data Partitioning for Distributed Speech Recognition
    Qi, Jun
    Tejedor, Javier
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2254 - 2258
  • [10] EXPLOITING MULTIMODAL DATA FUSION IN ROBUST SPEECH RECOGNITION
    Heracleous, Panikos
    Badin, Pierre
    Bailly, Gerard
    Hagita, Norihiro
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 568 - 572