Comparing NN paradigms in hybrid NN/HMM speech recognition using tied posteriors

被引：0

作者：

Stadermann, J ^{[1
]}

Rigoll, G ^{[1
]}

机构：

[1] Tech Univ Munich, Inst Human Machine Commun, D-80290 Munich, Germany

来源：

ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03 | 2003年

关键词：

D O I：

10.1109/ASRU.2003.1318409

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Hybrid NN/HAM acoustic modeling is nowadays an established alternative approach in automatic speech recognition technology. A comparison of feed-forward and recurrent neural network topologies integrated in the tied-posteriors framework is presented. We give some insights in the training process of the networks estimating class posterior probabilities and show how the net's quality can be determined by introducing a new measurement prior to evaluating the complete ASR system. Finally we demonstrate the flexibility of the tied-posteriors framework by showing results for different context independent and context dependent acoustic models all based on the same system structure.

引用

页码：89 / 93

页数：5

共 50 条

[41] RECOGNITION OF ARABIC LICENSE PLATES USING NN
Zidouri, Abdelmalek
Deriche, Mohammed
2008 FIRST INTERNATIONAL WORKSHOPS ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2008, : 228 - 231
[42] Hybrid modeling of PHMM and HMM for speech recognition
Ogawa, T
Kobayashi, T
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 140 - 143
[43] Automatic speech emotion recognition based on hybrid features with ANN, LDA and K_NN classifiers
Al Dujaili, Mohammed Jawad
Ebrahimi-Moghadam, Abbas
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (27) : 42783 - 42801
[44] Automatic speech emotion recognition based on hybrid features with ANN, LDA and K_NN classifiers
Mohammed Jawad Al Dujaili
Abbas Ebrahimi-Moghadam
Multimedia Tools and Applications, 2023, 82 : 42783 - 42801
[45] Handwritten Farsi Word Recognition Using NN-Based Fusion of HMM Classifiers with Different Types of Features
Arani, Seyed Ali Asghar Abbaszadeh
Kabir, Ehsanollah
Ebrahimpour, Reza
INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2019, 19 (01)
[46] Arabic phonemes recognition using hybrid LVQ/HMM model for continuous speech recognition
Nahar, Khalid M. O.
Abu Shquier, Mohammed
Al-Khatib, Wasfi G.
Al-Muhtaseb, Husni
Elshafei, Moustafa
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (03) : 495 - 508
[47] Comparison of Discriminative Input and Output Transformations for Speaker Adaptation in the Hybrid NN/HMM Systems
Li, Bo
Sim, Khe Chai
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 526 - 529
[48] MFCC based Recognition of Repetitions and Prolongations in Stuttered Speech using k-NN and LDA
Chee, Lim Sin
Ai, Ooi Chia
Hariharan, M.
Yaacob, Sazali
2009 IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT: SCORED 2009, PROCEEDINGS, 2009, : 146 - 149
[49] Applying Long Short-Term Memory concept to hybrid "CD-NN-HMM" model for keywords spotting in continuous speech
Dridi, Hinda
Ouni, Kais
2018 4TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2018,
[50] Speech emotion recognition based on a hybrid of HMM/ANN
Mao, Xia
Zhang, Bing
Luo, Yi
PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED INFORMATICS AND COMMUNICATIONS, 2007, : 369 - 372

← 1 2 3 4 5 →