A comparative study of model-based adaptation techniques for a compact speech recognizer

被引：0

作者：

Thiele, F ^{[1
]}

Bippus, R ^{[1
]}

机构：

[1] Philips Res Labs, D-52066 Aachen, Germany

来源：

ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS | 2001年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many techniques for speaker adaptation have been successfully applied to automatic speech recognition. This paper compares the performance of several adaptation methods with respect to their memory need and processing demand. For adaptation of a compact acoustic model with 4k densities, Eigenvoices and structural MAP (SMAP) are investigated next to the well-known techniques of MAP and MLLR adaptation. Experimental results are reported for unsupervised on-line adaptation on different amounts of adaptation data ranging from 4 to 500 words per speaker. The results show that for small amounts of adaptation data it might be more efficient to employ a larger baseline acoustic model without adaptation. Eigenvoices achieve the lowest word error rates of all adaptation techniques but SMAP presents a good compromise between memory requirement and accuracy.

引用

页码：29 / 32

页数：4

共 50 条

[1] Online Model Adaptation for Voice Conversion using Model-based Speech Synthesis Techniques
Wu, Dalei
Li, Baojie
Jiang, Hui
Fu, Qian-Jie
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1611 - +
[2] Speaker adaptation techniques for speech recognition with a speaker-independent phonetic recognizer
Kim, WG
Jang, M
[J]. COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 95 - 100
[3] A comparative study of two model-based control techniques for the industrial manipulator
Dumlu, Ahmet
Erenturk, Koksal
Kaleli, Aliriza
Ayten, Kagan Koray
[J]. ROBOTICA, 2017, 35 (10) : 2036 - 2055
[4] A comparative study of model-based control techniques for batch crystallization process
Shen, JX
Chiu, MS
Wang, QG
[J]. JOURNAL OF CHEMICAL ENGINEERING OF JAPAN, 1999, 32 (04) : 456 - 464
[5] A comparative Study of Model-Based and Data-Based Model Order Reduction Techniques for Nonlinear Systems
Aizad, T.
Maganga, O.
Sumislawska, M.
Burnham, K. J.
[J]. PROGRESS IN SYSTEMS ENGINEERING, 2015, 366 : 83 - 88
[6] Adaptation of an EMG-Based Speech Recognizer via Meta-Learning
Prorokovic, Krsto
Wand, Michael
Schultz, Tanja
Schmidhuber, Juergen
[J]. 2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
[7] A model distance maximizing framework for speech recognizer-based speech enhancement
BabaAli, Bagher
Sameti, Hossein
Falk, Tiago H.
[J]. AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2011, 65 (02) : 99 - 106
[8] A Bayesian view on acoustic model-based techniques for robust speech recognition
Maas, Roland
Huemmer, Christian
Sehr, Armin
Kellermann, Walter
[J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015, : 1 - 16
[9] A Bayesian view on acoustic model-based techniques for robust speech recognition
Roland Maas
Christian Huemmer
Armin Sehr
Walter Kellermann
[J]. EURASIP Journal on Advances in Signal Processing, 2015
[10] A Case Study in Model-Based Adaptation of Web Services
Camara, Javier
Antonio Martin, Jose
Salauen, Gwen
Canal, Carlos
Pimentel, Ernesto
[J]. LEVERAGING APPLICATIONS OF FORMAL METHODS, VERIFICATION, AND VALIDATION, PT II, 2010, 6416 : 112 - +

← 1 2 3 4 5 →