BAYESIAN DISCRIMINATIVE ADAPTATION FOR SPEECH RECOGNITION

被引:1
|
作者
Raut, C. K. [1 ]
Gales, M. J. F. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
speech recognition; model adaptation; discriminative transforms; maximum-a-posteriori estimation; SPEAKER ADAPTATION;
D O I
10.1109/ICASSP.2009.4960595
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Linear transform-based speaker adaptation is a standard part of many speech recognition systems. For unsupervised adaptation maximum likelihood estimation is typically used, as discriminative transforms are more heavily biased towards the supervision hypothesis which may contain errors. In this work a Bayesian framework for discriminative adaptation is investigated. This reduces the hypothesis bias and allows robust estimates even with a limited amount of data. Various forms of discriminative maximum-a-posteriori estimation, and associated issues, are detailed. To address these problems, the use of discriminative mapping transforms is also described. The proposed framework is evaluated on an English conversational speech task.
引用
收藏
页码:4361 / 4364
页数:4
相关论文
共 50 条
  • [41] Discriminative transform for confidence estimation in Mandarin speech recognition
    Guo, G
    Wang, RH
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 269 - 272
  • [42] A New Method for Discriminative Model Combination in Speech Recognition
    Wu Yahui
    Liu Gang
    Guo Jun
    [J]. 2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, VOLS 1 AND 2, PROCEEDINGS, 2008, : 200 - 203
  • [43] Discriminative training of stochastic Markov graphs for speech recognition
    Wolfertstetter, F
    Ruske, G
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 581 - 584
  • [44] A discriminative and robust training algorithm for noisy speech recognition
    Hong, WT
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 8 - 11
  • [45] Improvements to Generalized Discriminative Feature Transformation for Speech Recognition
    Hsiao, Roger
    Metze, Florian
    Schultz, Tanja
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1361 - 1364
  • [46] Scaling Laws for Discriminative Speech Recognition Rescoring Models
    Gu, Yile
    Shivakumar, Prashanth Gurunath
    Kolehmainen, Jari
    Gandhe, Ankur
    Rastrow, Ariya
    Bulyko, Ivan
    [J]. INTERSPEECH 2023, 2023, : 471 - 475
  • [47] Multi resolution discriminative models for subvocalic speech recognition
    Raugas, Mark
    Sridhar, Vivek Kumar Rangarajan
    Prasad, Rohit
    Natarajan, Prem
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2634 - 2637
  • [48] FAST SPEAKER ADAPTATION OF HYBRID NN/HMM MODEL FOR SPEECH RECOGNITION BASED ON DISCRIMINATIVE LEARNING OF SPEAKER CODE
    Abdel-Hamid, Ossama
    Jiang, Hui
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7942 - 7946
  • [49] NORMALIZATION AND ADAPTATION OF SPEECH DATA FOR AUTOMATIC SPEECH RECOGNITION
    SCARR, RWA
    [J]. INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1970, 2 (01): : 41 - 59
  • [50] Speech selection and environmental adaptation for asynchronous speech recognition
    Ren, Bo
    Wang, Longbiao
    Kai, Atsuhiko
    Zhang, Zhaofeng
    [J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 119 - 124