On maximum mutual information speaker-adapted training

被引：0

作者：

McDonough, J ^{[1
]}

Schaaf, T ^{[1
]}

Waibel, A ^{[1
]}

机构：

[1] Univ Karlsruhe, Inst Log Komplexitat & Deduktionsyst, Interact Syst Labs, D-76128 Karlsruhe, Germany

来源：

2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS | 2002年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this work, we combine maximum mutual information-based parameter estimation with speaker-adapted training (SAT). As will be shown, this can be achieved by performing unsupervised parameter estimation on the test data, a distinct advantage for many recognition tasks involving conversational speech. We also propose an approximation to the maximum likelihood and maximum mutual information SAT re-estimation formulae that greatly reduces the amount of disk space required to conduct training on corpora such as Broadcast News, which contains speech from thousands of speakers. We present the results of a set of speech recognition experiments on three test sets: the English Spontaneous Scheduling Task corpus, Broadcast News, and a new corpus of Meeting Room data collected at the Interactive Systems Laboratories of the Carnegie Mellon University.

引用

页码：601 / 604

页数：4

共 50 条

[21] Maximum mutual information regularized classification
Wang, Jim Jing-Yan
Wang, Yi
Zhao, Shiguang
Gao, Xin
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 37 : 1 - 8
[22] Maximum mutual information training for an online neural predictive handwritten word recognition system
Garcia-Salicetti S.
Dorizzi B.
Gallinari P.
Wimmer Z.
[J]. International Journal on Document Analysis and Recognition, 2001, 4 (01) : 56 - 68
[23] Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition Systems
Madikeri, Srikanth
Khonglah, Banriskhem K.
Tong, Sibo
Motlicek, Petr
Bourlard, Herve
Povey, Daniel
[J]. INTERSPEECH 2020, 2020, : 4746 - 4750
[24] Dimension reduction for speaker identification based on mutual information
Lu, Xugang
Dang, Jianwu
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1157 - 1160
[25] Maximum Gaussianality training for deep speaker vector normalization
Cai, Yunqi
Li, Lantian
Abel, Andrew
Zhu, Xiaoyan
Wang, Dong
[J]. Pattern Recognition, 2024, 145
[26] Maximum Gaussianality training for deep speaker vector normalization
Cai, Yunqi
Li, Lantian
Abel, Andrew
Zhu, Xiaoyan
Wang, Dong
[J]. PATTERN RECOGNITION, 2024, 145
[27] Semi-supervised Maximum Mutual Information Training of Deep Neural Network Acoustic Models
Manohar, Vimal
Povey, Daniel
Khudanpur, Sanjeev
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2630 - 2634
[28] Image Thresholding Based on Maximum Mutual Information
Fang, Lulu
Zou, Yaobin
Dong, Fangmin
Sun, Shuifa
Lei, Bangjun
[J]. 2014 7TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP 2014), 2014, : 403 - 409
[29] Part Recognition Based on Maximum Mutual Information
Ge Sen
Huang Dagui
[J]. PROGRESS OF MACHINING TECHNOLOGY, 2009, 407-408 : 234 - 238
[30] Employing maximum mutual information for Bayesian classification
van Gerven, M
Lucas, P
[J]. BIOLOGICAL AND MEDICAL DATA ANALYSIS, PROCEEDINGS, 2004, 3337 : 188 - 199

← 1 2 3 4 5 →