Comparing NN paradigms in hybrid NN/HMM speech recognition using tied posteriors

被引:0
|
作者
Stadermann, J [1 ]
Rigoll, G [1 ]
机构
[1] Tech Univ Munich, Inst Human Machine Commun, D-80290 Munich, Germany
来源
ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03 | 2003年
关键词
D O I
10.1109/ASRU.2003.1318409
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hybrid NN/HAM acoustic modeling is nowadays an established alternative approach in automatic speech recognition technology. A comparison of feed-forward and recurrent neural network topologies integrated in the tied-posteriors framework is presented. We give some insights in the training process of the networks estimating class posterior probabilities and show how the net's quality can be determined by introducing a new measurement prior to evaluating the complete ASR system. Finally we demonstrate the flexibility of the tied-posteriors framework by showing results for different context independent and context dependent acoustic models all based on the same system structure.
引用
收藏
页码:89 / 93
页数:5
相关论文
共 50 条
  • [21] NN and hybrid strategies for speech recognition in romanian language
    Dumitru, Corneliu-Octavian
    Gavat, Inge
    ANNIP 2008: PROCEEDINGS OF THE ARTIFICIAL NEURAL NETWORKS AND INTELLIGENT INFORMATION PROCESSING, 2008, : 51 - 60
  • [22] FAST SPEAKER ADAPTATION OF HYBRID NN/HMM MODEL FOR SPEECH RECOGNITION BASED ON DISCRIMINATIVE LEARNING OF SPEAKER CODE
    Abdel-Hamid, Ossama
    Jiang, Hui
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7942 - 7946
  • [23] Probability estimation in hybrid NN-HMM speech recognition systems with real-time neural networks
    Georgescu, SM
    IEEE INTERNATIONAL JOINT SYMPOSIA ON INTELLIGENCE AND SYSTEMS - PROCEEDINGS, 1998, : 412 - 417
  • [24] Evaluating NN and HMM classifiers for handwritten word recognition
    De Oliveira, JJ
    de Carvalho, JM
    Freitas, COD
    Sabourin, R
    SIBGRAPI 2002: XV BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING, PROCEEDINGS, 2002, : 210 - 217
  • [25] A hybrid approach of NN and HMM for facial emotion classification
    Hu, TM
    De Silva, LC
    Sengupta, K
    PATTERN RECOGNITION LETTERS, 2002, 23 (11) : 1303 - 1310
  • [26] Hybrid modeling, HMM/NN architectures, and protein applications
    Baldi, P
    Chauvin, Y
    NEURAL COMPUTATION, 1996, 8 (07) : 1541 - 1565
  • [27] Improving Low-Resource Speech Recognition Based on Improved NN-HMM Structures
    Sun, Xiusong
    Yang, Qun
    Liu, Shaohan
    Yuan, Xin
    IEEE ACCESS, 2020, 8 : 73005 - 73014
  • [28] A hybrid NN-HMM system for connected digit recognition over telephone in Romanian language
    Gavat, I
    Zirra, M
    Enescu, V
    THIRD IEEE WORKSHOP ON INTERACTIVE VOICE TECHNOLOGY FOR TELECOMMUNICATIONS APPLICATIONS - IVTTA-96, PROCEEDINGS, 1996, : 37 - 40
  • [29] Mandarin emotional speech recognition based on SVM and NN
    Pao, Tsang-Long
    Chen, Yu-Te
    Yeh, Jun-Heng
    Li, Pei-Jia
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 1096 - +
  • [30] Student Modeling Using NN-HMM for EFL Course
    Homsi, Masun
    Lutfi, Rania
    Maria, Carto Rosa
    Barakat, Ghias
    2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 354 - +