Comparing NN paradigms in hybrid NN/HMM speech recognition using tied posteriors

被引:0
|
作者
Stadermann, J [1 ]
Rigoll, G [1 ]
机构
[1] Tech Univ Munich, Inst Human Machine Commun, D-80290 Munich, Germany
关键词
D O I
10.1109/ASRU.2003.1318409
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hybrid NN/HAM acoustic modeling is nowadays an established alternative approach in automatic speech recognition technology. A comparison of feed-forward and recurrent neural network topologies integrated in the tied-posteriors framework is presented. We give some insights in the training process of the networks estimating class posterior probabilities and show how the net's quality can be determined by introducing a new measurement prior to evaluating the complete ASR system. Finally we demonstrate the flexibility of the tied-posteriors framework by showing results for different context independent and context dependent acoustic models all based on the same system structure.
引用
收藏
页码:89 / 93
页数:5
相关论文
共 50 条
  • [1] Tied posteriors: An approach for effective introduction of context dependency in hybrid NN/HMM LVCSR
    Rottland, J
    Rigoll, G
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1241 - 1244
  • [2] HMM/NN hybrids for continuous speech recognition
    Alim, OAA
    Elboghdadly, N
    El Shaar, NM
    PROCEEDINGS OF THE EIGHTEENTH NATIONAL RADIO SCIENCE CONFERENCE, VOLS 1 AND 2, 2001, : 509 - 516
  • [3] Hybrid NN/HMM acoustic modeling techniques for distributed speech recognition
    Stadermann, Jan
    Rigoll, Gerhard
    SPEECH COMMUNICATION, 2006, 48 (08) : 1037 - 1046
  • [4] Hybrid HMM-NN for speech recognition and prior class probabilities
    Albesano, D
    Gemello, R
    Mana, F
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 2391 - 2395
  • [5] Comparing Hybrid NN-HMM and RNN for Temporal Modeling in Gesture Recognition
    Granger, Nicolas
    el Yacoubi, Mounim A.
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 147 - 156
  • [6] Applying Batch Normalization to Hybrid NN-HMM Model For Speech Recognition
    Zhan, Hongjian
    Chen, Guilin
    Lu, Yue
    PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 427 - 435
  • [7] A NN/HMM hybrid for continuous speech recognition with a discriminant nonlinear feature extraction
    Rigoll, G
    Willett, D
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 9 - 12
  • [8] LEREC - A NN/HMM HYBRID FOR ONLINE HANDWRITING RECOGNITION
    BENGIO, Y
    LECUN, Y
    NOHL, C
    BURGES, C
    NEURAL COMPUTATION, 1995, 7 (06) : 1289 - 1303
  • [9] Hybrid NN/HMM-based speech recognition with a discriminant neural feature extraction
    Willett, D
    Rigoll, G
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 763 - 769
  • [10] On a Hybrid NN/HMM Speech Recognition System with a RNN-Based Language Model
    Soutner, Daniel
    Zelinka, Jan
    Mueller, Ludek
    SPEECH AND COMPUTER, 2014, 8773 : 315 - 321