A comparative study of model-based adaptation techniques for a compact speech recognizer

被引:0
|
作者
Thiele, F [1 ]
Bippus, R [1 ]
机构
[1] Philips Res Labs, D-52066 Aachen, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many techniques for speaker adaptation have been successfully applied to automatic speech recognition. This paper compares the performance of several adaptation methods with respect to their memory need and processing demand. For adaptation of a compact acoustic model with 4k densities, Eigenvoices and structural MAP (SMAP) are investigated next to the well-known techniques of MAP and MLLR adaptation. Experimental results are reported for unsupervised on-line adaptation on different amounts of adaptation data ranging from 4 to 500 words per speaker. The results show that for small amounts of adaptation data it might be more efficient to employ a larger baseline acoustic model without adaptation. Eigenvoices achieve the lowest word error rates of all adaptation techniques but SMAP presents a good compromise between memory requirement and accuracy.
引用
收藏
页码:29 / 32
页数:4
相关论文
共 50 条
  • [1] Online Model Adaptation for Voice Conversion using Model-based Speech Synthesis Techniques
    Wu, Dalei
    Li, Baojie
    Jiang, Hui
    Fu, Qian-Jie
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1611 - +
  • [2] Speaker adaptation techniques for speech recognition with a speaker-independent phonetic recognizer
    Kim, WG
    Jang, M
    [J]. COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 95 - 100
  • [3] A comparative study of two model-based control techniques for the industrial manipulator
    Dumlu, Ahmet
    Erenturk, Koksal
    Kaleli, Aliriza
    Ayten, Kagan Koray
    [J]. ROBOTICA, 2017, 35 (10) : 2036 - 2055
  • [4] A comparative study of model-based control techniques for batch crystallization process
    Shen, JX
    Chiu, MS
    Wang, QG
    [J]. JOURNAL OF CHEMICAL ENGINEERING OF JAPAN, 1999, 32 (04) : 456 - 464
  • [5] A comparative Study of Model-Based and Data-Based Model Order Reduction Techniques for Nonlinear Systems
    Aizad, T.
    Maganga, O.
    Sumislawska, M.
    Burnham, K. J.
    [J]. PROGRESS IN SYSTEMS ENGINEERING, 2015, 366 : 83 - 88
  • [6] Adaptation of an EMG-Based Speech Recognizer via Meta-Learning
    Prorokovic, Krsto
    Wand, Michael
    Schultz, Tanja
    Schmidhuber, Juergen
    [J]. 2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
  • [7] A model distance maximizing framework for speech recognizer-based speech enhancement
    BabaAli, Bagher
    Sameti, Hossein
    Falk, Tiago H.
    [J]. AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2011, 65 (02) : 99 - 106
  • [8] A Bayesian view on acoustic model-based techniques for robust speech recognition
    Maas, Roland
    Huemmer, Christian
    Sehr, Armin
    Kellermann, Walter
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015, : 1 - 16
  • [9] A Bayesian view on acoustic model-based techniques for robust speech recognition
    Roland Maas
    Christian Huemmer
    Armin Sehr
    Walter Kellermann
    [J]. EURASIP Journal on Advances in Signal Processing, 2015
  • [10] A Case Study in Model-Based Adaptation of Web Services
    Camara, Javier
    Antonio Martin, Jose
    Salauen, Gwen
    Canal, Carlos
    Pimentel, Ernesto
    [J]. LEVERAGING APPLICATIONS OF FORMAL METHODS, VERIFICATION, AND VALIDATION, PT II, 2010, 6416 : 112 - +