BAYESIAN DISCRIMINATIVE ADAPTATION FOR SPEECH RECOGNITION

被引：1

作者：

Raut, C. K. ^{[1
]}

Gales, M. J. F. ^{[1
]}

机构：

[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England

来源：

2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年

关键词：

speech recognition; model adaptation; discriminative transforms; maximum-a-posteriori estimation; SPEAKER ADAPTATION;

D O I：

10.1109/ICASSP.2009.4960595

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Linear transform-based speaker adaptation is a standard part of many speech recognition systems. For unsupervised adaptation maximum likelihood estimation is typically used, as discriminative transforms are more heavily biased towards the supervision hypothesis which may contain errors. In this work a Bayesian framework for discriminative adaptation is investigated. This reduces the hypothesis bias and allows robust estimates even with a limited amount of data. Various forms of discriminative maximum-a-posteriori estimation, and associated issues, are detailed. To address these problems, the use of discriminative mapping transforms is also described. The proposed framework is evaluated on an English conversational speech task.

引用

页码：4361 / 4364

页数：4

共 50 条

[1] Discriminative speaker adaptation in Persian continuous speech recognition systems
Pirhosseinloo, Shadi
Ganj, Farshad Almas
[J]. 4TH INTERNATIONAL CONFERENCE OF COGNITIVE SCIENCE, 2012, 32 : 296 - 301
[2] Bayesian confidence scoring and adaptation techniques for speech recognition
Kim, TY
Ko, H
[J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2005, E88B (04) : 1756 - 1759
[3] Noise-robust speech recognition by discriminative adaptation in parallel model combination
Chung, YJ
[J]. ELECTRONICS LETTERS, 2000, 36 (04) : 370 - 371
[4] Telephone speech recognition based on Bayesian adaptation of hidden Markov models
Chien, JT
Wang, HC
[J]. SPEECH COMMUNICATION, 1997, 22 (04) : 369 - 384
[5] Discriminative-models for speech recognition
Gales, M. J. F.
[J]. 2007 INFORMATION THEORY AND APPLICATIONS WORKSHOP, 2007, : 168 - 174
[6] Discriminative Training for Automatic Speech Recognition
Heigold, Georg
Ney, Hermann
Schlueter, Ralf
Wiesler, Simon
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 58 - 69
[7] Structured Discriminative Models for Speech Recognition
Gales, Mark
Watanabe, Shinji
Fosler-Lussier, Eric
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 70 - 81
[8] Structured Discriminative Models for Speech Recognition
Gales, Mark
[J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : XXII - XXII
[9] Discriminative Named Entity Recognition of Speech Data using Speech Recognition Confidence
Sudoh, Katsuhito
Tsukada, Hajime
Isozaki, Hideki
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 337 - 340
[10] Incorporating speech recognition confidence into discriminative named entity recognition of speech data
Sudoh, Katsuhito
Tsukada, Hajime
Isozaki, Hideki
[J]. COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 617 - 624

← 1 2 3 4 5 →