BAYESIAN DISCRIMINATIVE ADAPTATION FOR SPEECH RECOGNITION

被引:1
|
作者
Raut, C. K. [1 ]
Gales, M. J. F. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
speech recognition; model adaptation; discriminative transforms; maximum-a-posteriori estimation; SPEAKER ADAPTATION;
D O I
10.1109/ICASSP.2009.4960595
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Linear transform-based speaker adaptation is a standard part of many speech recognition systems. For unsupervised adaptation maximum likelihood estimation is typically used, as discriminative transforms are more heavily biased towards the supervision hypothesis which may contain errors. In this work a Bayesian framework for discriminative adaptation is investigated. This reduces the hypothesis bias and allows robust estimates even with a limited amount of data. Various forms of discriminative maximum-a-posteriori estimation, and associated issues, are detailed. To address these problems, the use of discriminative mapping transforms is also described. The proposed framework is evaluated on an English conversational speech task.
引用
收藏
页码:4361 / 4364
页数:4
相关论文
共 50 条
  • [1] Discriminative speaker adaptation in Persian continuous speech recognition systems
    Pirhosseinloo, Shadi
    Ganj, Farshad Almas
    [J]. 4TH INTERNATIONAL CONFERENCE OF COGNITIVE SCIENCE, 2012, 32 : 296 - 301
  • [2] Bayesian confidence scoring and adaptation techniques for speech recognition
    Kim, TY
    Ko, H
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2005, E88B (04) : 1756 - 1759
  • [3] Noise-robust speech recognition by discriminative adaptation in parallel model combination
    Chung, YJ
    [J]. ELECTRONICS LETTERS, 2000, 36 (04) : 370 - 371
  • [4] Telephone speech recognition based on Bayesian adaptation of hidden Markov models
    Chien, JT
    Wang, HC
    [J]. SPEECH COMMUNICATION, 1997, 22 (04) : 369 - 384
  • [5] Discriminative-models for speech recognition
    Gales, M. J. F.
    [J]. 2007 INFORMATION THEORY AND APPLICATIONS WORKSHOP, 2007, : 168 - 174
  • [6] Discriminative Training for Automatic Speech Recognition
    Heigold, Georg
    Ney, Hermann
    Schlueter, Ralf
    Wiesler, Simon
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 58 - 69
  • [7] Structured Discriminative Models for Speech Recognition
    Gales, Mark
    Watanabe, Shinji
    Fosler-Lussier, Eric
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 70 - 81
  • [8] Structured Discriminative Models for Speech Recognition
    Gales, Mark
    [J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : XXII - XXII
  • [9] Discriminative Named Entity Recognition of Speech Data using Speech Recognition Confidence
    Sudoh, Katsuhito
    Tsukada, Hajime
    Isozaki, Hideki
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 337 - 340
  • [10] Incorporating speech recognition confidence into discriminative named entity recognition of speech data
    Sudoh, Katsuhito
    Tsukada, Hajime
    Isozaki, Hideki
    [J]. COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 617 - 624