Combined optimisation of baseforms and subword models for an HMM based speech recogniser

被引：0

作者：

Holter, T

Svendsen, T

机构：

来源：

ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2 | 1996年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper a framework for combined optimisation of baseforms and subword models for a speech recogniser is proposed. Given a set of subword Hidden Markov Models (HMMs) and a set of utterances of a specific word, the modified tree-trellis algorithm and the Baum-Welch re-estimation procedure is used iteratively to achieve a combined optimisation of baseforms and subword models. The DARPA Resource Management (RM) database was used to evaluate the combined optimisation scheme. The proposed method resulted in a monotonic increase in the likelihood score of both test- and training data. When compared to the initial lexicon derived from the DARPA RM-distribution and a set of initial HMMs, a 13% reduction in word error rate is achieved at best.

引用

页码：321 / 324

页数：4

共 50 条

[1] Combined optimisation of baseforms and model parameters in speech recognition based on acoustic subword units
Holter, T
Svendsen, T
[J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 199 - 206
[2] Advances in subword-based HMM-DNN speech recognition across languages
Smit, Peter
Virpioja, Sami
Kurimo, Mikko
[J]. COMPUTER SPEECH AND LANGUAGE, 2021, 66
[3] Transforming Features to Compensate Speech Recogniser Models for Noise
van Dalen, R. C.
Flego, F.
Gales, M. J. F.
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2459 - 2462
[4] Designing Syllable Models for an HMM Based Speech Recognition System
Proenca, Kseniya
Demuynck, Kris
Van Compernolle, Dirk
[J]. Speech and Computer, 2016, 9811 : 216 - 223
[5] FACTOR ANALYZED VOICE MODELS FOR HMM-BASED SPEECH SYNTHESIS
Kazumi, Kyosuke
Nankaku, Yoshihiko
Tokuda, Keiichi
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4234 - 4237
[6] Synthesis of stressed speech from isolated neutral speech using HMM-based models
BouGhazale, SE
Hansen, JHL
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1860 - 1863
[7] Speech recognition with HMM models for cochlear prostheses
Sakka, Z
Kachouri, A
Samet, M
[J]. 2004 IEEE International Conference on Industrial Technology (ICIT), Vols. 1- 3, 2004, : 1478 - 1481
[8] Subword unit based speech recognition in car environments
Fischer, A
Stahl, V
[J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 257 - 260
[9] AN HMM-BASED FORMALISM FOR AUTOMATIC SUBWORD UNIT DERIVATION AND PRONUNCIATION GENERATION
Razavi, Marzieh
Magimai-Doss, Mathew
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4639 - 4643
[10] On HMM speech recognition based on complex speech analysis
Kinjo, Tatsuhiko
Funaki, Keiichi
[J]. IECON 2006 - 32ND ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS, VOLS 1-11, 2006, : 2605 - +

← 1 2 3 4 5 →