A comparative study of model-based adaptation techniques for a compact speech recognizer

被引:0
|
作者
Thiele, F [1 ]
Bippus, R [1 ]
机构
[1] Philips Res Labs, D-52066 Aachen, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many techniques for speaker adaptation have been successfully applied to automatic speech recognition. This paper compares the performance of several adaptation methods with respect to their memory need and processing demand. For adaptation of a compact acoustic model with 4k densities, Eigenvoices and structural MAP (SMAP) are investigated next to the well-known techniques of MAP and MLLR adaptation. Experimental results are reported for unsupervised on-line adaptation on different amounts of adaptation data ranging from 4 to 500 words per speaker. The results show that for small amounts of adaptation data it might be more efficient to employ a larger baseline acoustic model without adaptation. Eigenvoices achieve the lowest word error rates of all adaptation techniques but SMAP presents a good compromise between memory requirement and accuracy.
引用
收藏
页码:29 / 32
页数:4
相关论文
共 50 条
  • [31] A study on model-based error rate estimation for automatic speech recognition
    Huang, CS
    Wang, HC
    Lee, CH
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (06): : 581 - 589
  • [32] Model-based Policy Optimization with Unsupervised Model Adaptation
    Shen, Jian
    Zhao, Han
    Zhang, Weinan
    Yu, Yong
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [33] Comparative study of model-based PI(D) autotuning methods
    Leva, Alberto
    [J]. 2007 AMERICAN CONTROL CONFERENCE, VOLS 1-13, 2007, : 2788 - 2793
  • [34] A comparative study of speech rate estimation techniques
    Dekens, Tomas
    Demol, Mike
    Verhelst, Werner
    Verhoeve, Piet
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 225 - +
  • [35] Comparative study of automatic speech recognition techniques
    Cutajar, Michelle
    Gatt, Edward
    Grech, Ivan
    Casha, Owen
    Micallef, Joseph
    [J]. IET SIGNAL PROCESSING, 2013, 7 (01) : 25 - 46
  • [36] Comparative study of automatic speech recognition techniques
    [J]. 1600, Institution of Engineering and Technology, United States (07):
  • [37] A Comparative Study of Audio/Speech Steganalysis Techniques
    Paulin, Catherine
    Selouani, Sid-Ahmed
    Hervet, Eric
    [J]. 2017 IEEE 30TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2017,
  • [38] Model-based controllers for CubeSat ORU installation: A comparative study
    Kurnell, Mitchell
    Sharf, Inna
    [J]. ACTA ASTRONAUTICA, 2024, 223 : 666 - 684
  • [39] NOISE IDENTIFICATION FOR MODEL-BASED SPEECH ENHANCEMENT
    Jiang Wenbin
    Ying Rendong
    Liu Peilin
    [J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 478 - 483
  • [40] Model-Based Speech Enhancement in the Modulation Domain
    Wang, Yu
    Brookes, Mike
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (03) : 580 - 594