A comparative study of model-based adaptation techniques for a compact speech recognizer

被引:0
|
作者
Thiele, F [1 ]
Bippus, R [1 ]
机构
[1] Philips Res Labs, D-52066 Aachen, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many techniques for speaker adaptation have been successfully applied to automatic speech recognition. This paper compares the performance of several adaptation methods with respect to their memory need and processing demand. For adaptation of a compact acoustic model with 4k densities, Eigenvoices and structural MAP (SMAP) are investigated next to the well-known techniques of MAP and MLLR adaptation. Experimental results are reported for unsupervised on-line adaptation on different amounts of adaptation data ranging from 4 to 500 words per speaker. The results show that for small amounts of adaptation data it might be more efficient to employ a larger baseline acoustic model without adaptation. Eigenvoices achieve the lowest word error rates of all adaptation techniques but SMAP presents a good compromise between memory requirement and accuracy.
引用
收藏
页码:29 / 32
页数:4
相关论文
共 50 条
  • [21] INDIRECT MODEL-BASED SPEECH ENHANCEMENT
    Le Roux, Jonathan
    Hershey, John R.
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4045 - 4048
  • [22] Adaptive model-based speech enhancement
    Logan, B
    Robinson, T
    [J]. SPEECH COMMUNICATION, 2001, 34 (04) : 351 - 368
  • [23] Inversion-based nonlinear adaptation of noisy acoustic parameters for a neural/HMM speech recognizer
    Trentin, Edmondo
    Gori, Marco
    [J]. NEUROCOMPUTING, 2006, 70 (1-3) : 398 - 408
  • [24] Model-Based Adaptation for Robotics Software
    Aldrich, Jonathan
    Garlan, David
    Kaestner, Christian
    Le Goues, Claire
    Mohseni-Kabir, Anahita
    Ruchkin, Ivan
    Samuel, Selva
    Schmerl, Bradley
    Timperley, Christopher
    Veloso, Manuela
    Voysey, Ian
    Biswas, Joydeep
    Guha, Arjun
    Holtz, Jarrett
    Camara, Javier
    Jamshidi, Pooyan
    [J]. IEEE SOFTWARE, 2019, 36 (02) : 83 - 90
  • [25] Model-based user interface adaptation
    Nilsson, Erik G.
    Floch, Jacqueline
    Hallsteinsen, Svein
    Stav, Erlend
    [J]. COMPUTERS & GRAPHICS-UK, 2006, 30 (05): : 692 - 701
  • [26] Experiments on Front-End Techniques and Segmentation Model for Robust Indian Language Speech Recognizer
    Sriranjani, R.
    Karthick, Murali B.
    Umesh, S.
    [J]. 2014 TWENTIETH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2014,
  • [27] Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition
    Deena, Saud
    Hasan, Madina
    Doulaty, Mortaza
    Saz, Oscar
    Hain, Thomas
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2343 - 2347
  • [28] Towards non-stationary model-based noise adaptation for large vocabulary speech recognition
    Kristjansson, T
    Frey, B
    Deng, L
    Acero, A
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 337 - 340
  • [29] A Metric for Evaluating Speech Recognizer Output based on Human-Perception Model
    Itoh, Nobuyasu
    Kurata, Gakuto
    Tachibana, Ryuki
    Nishimura, Masafumi
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1285 - 1288
  • [30] Protein evolution constraints and model-based techniques to study them
    Thorne, Jeffrey L.
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2007, 17 (03) : 337 - 341