On maximum mutual information speaker-adapted training

被引:0
|
作者
McDonough, J [1 ]
Schaaf, T [1 ]
Waibel, A [1 ]
机构
[1] Univ Karlsruhe, Inst Log Komplexitat & Deduktionsyst, Interact Syst Labs, D-76128 Karlsruhe, Germany
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this work, we combine maximum mutual information-based parameter estimation with speaker-adapted training (SAT). As will be shown, this can be achieved by performing unsupervised parameter estimation on the test data, a distinct advantage for many recognition tasks involving conversational speech. We also propose an approximation to the maximum likelihood and maximum mutual information SAT re-estimation formulae that greatly reduces the amount of disk space required to conduct training on corpora such as Broadcast News, which contains speech from thousands of speakers. We present the results of a set of speech recognition experiments on three test sets: the English Spontaneous Scheduling Task corpus, Broadcast News, and a new corpus of Meeting Room data collected at the Interactive Systems Laboratories of the Carnegie Mellon University.
引用
收藏
页码:601 / 604
页数:4
相关论文
共 50 条
  • [21] Maximum mutual information regularized classification
    Wang, Jim Jing-Yan
    Wang, Yi
    Zhao, Shiguang
    Gao, Xin
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 37 : 1 - 8
  • [22] Maximum mutual information training for an online neural predictive handwritten word recognition system
    Garcia-Salicetti S.
    Dorizzi B.
    Gallinari P.
    Wimmer Z.
    [J]. International Journal on Document Analysis and Recognition, 2001, 4 (01) : 56 - 68
  • [23] Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition Systems
    Madikeri, Srikanth
    Khonglah, Banriskhem K.
    Tong, Sibo
    Motlicek, Petr
    Bourlard, Herve
    Povey, Daniel
    [J]. INTERSPEECH 2020, 2020, : 4746 - 4750
  • [24] Dimension reduction for speaker identification based on mutual information
    Lu, Xugang
    Dang, Jianwu
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1157 - 1160
  • [25] Maximum Gaussianality training for deep speaker vector normalization
    Cai, Yunqi
    Li, Lantian
    Abel, Andrew
    Zhu, Xiaoyan
    Wang, Dong
    [J]. Pattern Recognition, 2024, 145
  • [26] Maximum Gaussianality training for deep speaker vector normalization
    Cai, Yunqi
    Li, Lantian
    Abel, Andrew
    Zhu, Xiaoyan
    Wang, Dong
    [J]. PATTERN RECOGNITION, 2024, 145
  • [27] Semi-supervised Maximum Mutual Information Training of Deep Neural Network Acoustic Models
    Manohar, Vimal
    Povey, Daniel
    Khudanpur, Sanjeev
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2630 - 2634
  • [28] Image Thresholding Based on Maximum Mutual Information
    Fang, Lulu
    Zou, Yaobin
    Dong, Fangmin
    Sun, Shuifa
    Lei, Bangjun
    [J]. 2014 7TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP 2014), 2014, : 403 - 409
  • [29] Part Recognition Based on Maximum Mutual Information
    Ge Sen
    Huang Dagui
    [J]. PROGRESS OF MACHINING TECHNOLOGY, 2009, 407-408 : 234 - 238
  • [30] Employing maximum mutual information for Bayesian classification
    van Gerven, M
    Lucas, P
    [J]. BIOLOGICAL AND MEDICAL DATA ANALYSIS, PROCEEDINGS, 2004, 3337 : 188 - 199