Unsupervised Cross-Adaptation Using Language Model and Deep Learning Based Acoustic Model Adaptations

被引:0
|
作者
Takagi, Akira [1 ]
Konno, Kazuki [1 ]
Kato, Masaharu [1 ]
Kosaka, Tetsuo [1 ]
机构
[1] Yamagata Univ, Grad Sch Sci & Engn, Yonezawa, Yamagata, Japan
基金
日本学术振兴会;
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
It is well known that deep learning-based speech recognition improves performance significantly. In deep learning based systems, the deep neural network hidden Markov model (DNN-HMM) is used as an acoustic model (AM). Recently, speaker adaptation techniques based on DNN-HMM have also been investigated. The aim of this work is to improve the performance of unsupervised batch adaptation using DNN-HMM. The proposed adaptation method is based on the cross-adaptation approach, where complementary information derived from several systems is used. Gaussian mixture model HMM (GMM-HMM), DNN-HMM, and language model (LM) adaptation processes are conducted sequentially in the cross-adaptation procedure. The proposed adaptation method was evaluated on a Japanese lecture speech recognition task, reducing the error rate by 13.5% compared to the baseline DNN-HMM-based large vocabulary continuous speech recognition system.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Unsupervised language model adaptation
    Bacchiani, M
    Roark, B
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 224 - 227
  • [2] Unsupervised Acoustic Model Adaptation Based on Ensemble Methods
    Shinozaki, Takahiro
    Kubota, Yu
    Furui, Sadaoki
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (06) : 1007 - 1015
  • [3] Learning Hidden Unit Contributions for Unsupervised Acoustic Model Adaptation
    Swietojanski, Pawel
    Li, Jinyu
    Renals, Steve
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (08) : 1450 - 1463
  • [4] Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model
    Li, Juntao
    He, Ruidan
    Ye, Hai
    Ng, Hwee Tou
    Bing, Lidong
    Yan, Rui
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3672 - 3678
  • [5] Unsupervised Language Model Adaptation Using Latent Semantic Marginals
    Tam, Yik-Cheung
    Schultz, Tanja
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2206 - 2209
  • [6] I-vector Based Deep Neural Network Acoustic Model Adaptation Using Multilingual Language Resource
    Xu, Haihua
    Rao, Wei
    Xiao, Xiong
    Huang, Hao
    Chng, Eng-Siong
    Li, Haizhou
    [J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [7] Unsupervised acoustic model adaptation algorithm using MLLR in a noisy environment
    Yamada, M
    Baba, A
    Yoshizawa, S
    Mera, Y
    Lee, A
    Saruwatari, H
    Shikano, K
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2006, 89 (03): : 48 - 58
  • [8] Unsupervised language model adaptation for broadcast news
    Chen, LZ
    Gauvain, JL
    Lamel, L
    Adda, G
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 220 - 223
  • [9] Unsupervised language model adaptation for meeting recognition
    Tur, Gokhan
    Stolcke, Andreas
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 173 - +
  • [10] Unsupervised Acoustic Model Training for the Korean Language
    Laurent, Antoine
    Hartmann, William
    Lamel, Lori
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 469 - 473