Comparing NN paradigms in hybrid NN/HMM speech recognition using tied posteriors

被引:0
|
作者
Stadermann, J [1 ]
Rigoll, G [1 ]
机构
[1] Tech Univ Munich, Inst Human Machine Commun, D-80290 Munich, Germany
来源
ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03 | 2003年
关键词
D O I
10.1109/ASRU.2003.1318409
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hybrid NN/HAM acoustic modeling is nowadays an established alternative approach in automatic speech recognition technology. A comparison of feed-forward and recurrent neural network topologies integrated in the tied-posteriors framework is presented. We give some insights in the training process of the networks estimating class posterior probabilities and show how the net's quality can be determined by introducing a new measurement prior to evaluating the complete ASR system. Finally we demonstrate the flexibility of the tied-posteriors framework by showing results for different context independent and context dependent acoustic models all based on the same system structure.
引用
收藏
页码:89 / 93
页数:5
相关论文
共 50 条
  • [41] RECOGNITION OF ARABIC LICENSE PLATES USING NN
    Zidouri, Abdelmalek
    Deriche, Mohammed
    2008 FIRST INTERNATIONAL WORKSHOPS ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2008, : 228 - 231
  • [42] Hybrid modeling of PHMM and HMM for speech recognition
    Ogawa, T
    Kobayashi, T
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 140 - 143
  • [43] Automatic speech emotion recognition based on hybrid features with ANN, LDA and K_NN classifiers
    Al Dujaili, Mohammed Jawad
    Ebrahimi-Moghadam, Abbas
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (27) : 42783 - 42801
  • [44] Automatic speech emotion recognition based on hybrid features with ANN, LDA and K_NN classifiers
    Mohammed Jawad Al Dujaili
    Abbas Ebrahimi-Moghadam
    Multimedia Tools and Applications, 2023, 82 : 42783 - 42801
  • [45] Handwritten Farsi Word Recognition Using NN-Based Fusion of HMM Classifiers with Different Types of Features
    Arani, Seyed Ali Asghar Abbaszadeh
    Kabir, Ehsanollah
    Ebrahimpour, Reza
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2019, 19 (01)
  • [46] Arabic phonemes recognition using hybrid LVQ/HMM model for continuous speech recognition
    Nahar, Khalid M. O.
    Abu Shquier, Mohammed
    Al-Khatib, Wasfi G.
    Al-Muhtaseb, Husni
    Elshafei, Moustafa
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (03) : 495 - 508
  • [47] Comparison of Discriminative Input and Output Transformations for Speaker Adaptation in the Hybrid NN/HMM Systems
    Li, Bo
    Sim, Khe Chai
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 526 - 529
  • [48] MFCC based Recognition of Repetitions and Prolongations in Stuttered Speech using k-NN and LDA
    Chee, Lim Sin
    Ai, Ooi Chia
    Hariharan, M.
    Yaacob, Sazali
    2009 IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT: SCORED 2009, PROCEEDINGS, 2009, : 146 - 149
  • [49] Applying Long Short-Term Memory concept to hybrid "CD-NN-HMM" model for keywords spotting in continuous speech
    Dridi, Hinda
    Ouni, Kais
    2018 4TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2018,
  • [50] Speech emotion recognition based on a hybrid of HMM/ANN
    Mao, Xia
    Zhang, Bing
    Luo, Yi
    PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED INFORMATICS AND COMMUNICATIONS, 2007, : 369 - 372