Unsupervised Cross-Adaptation Using Language Model and Deep Learning Based Acoustic Model Adaptations

被引:0
|
作者
Takagi, Akira [1 ]
Konno, Kazuki [1 ]
Kato, Masaharu [1 ]
Kosaka, Tetsuo [1 ]
机构
[1] Yamagata Univ, Grad Sch Sci & Engn, Yonezawa, Yamagata, Japan
基金
日本学术振兴会;
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
It is well known that deep learning-based speech recognition improves performance significantly. In deep learning based systems, the deep neural network hidden Markov model (DNN-HMM) is used as an acoustic model (AM). Recently, speaker adaptation techniques based on DNN-HMM have also been investigated. The aim of this work is to improve the performance of unsupervised batch adaptation using DNN-HMM. The proposed adaptation method is based on the cross-adaptation approach, where complementary information derived from several systems is used. Gaussian mixture model HMM (GMM-HMM), DNN-HMM, and language model (LM) adaptation processes are conducted sequentially in the cross-adaptation procedure. The proposed adaptation method was evaluated on a Japanese lecture speech recognition task, reducing the error rate by 13.5% compared to the baseline DNN-HMM-based large vocabulary continuous speech recognition system.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] A Compressed Unsupervised Deep Domain Adaptation Model for Efficient Cross-Domain Fault Diagnosis
    Xu, Gaowei
    Huang, Chenxi
    Silva, Daniel Santos da
    Albuquerque, Victor Hugo C. de
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (05) : 6741 - 6749
  • [42] Unsupervised Language Model Adaptation for Mandarin Broadcast Conversation Transcription
    Mrva, David
    Woodland, Philip C.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2210 - 2213
  • [43] Unsupervised Language Model Adaptation by Data Selection for Speech Recognition
    Khassanov, Yerbolat
    Chong, Tze Yuang
    Bigot, Benjamin
    Chng, Eng Siong
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2017, PT I, 2017, 10191 : 508 - 517
  • [44] Unsupervised language model adaptation for handwritten Chinese text recognition
    Wang, Qiu-Feng
    Yin, Fei
    Liu, Cheng-Lin
    [J]. PATTERN RECOGNITION, 2014, 47 (03) : 1202 - 1216
  • [45] Improving Unsupervised Language Model Adaptation with Discriminative Data Filtering
    Chang, Shuangyu
    Levit, Michael
    Parthasarathy, Partha
    Dumoulin, Benoit
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1207 - 1211
  • [46] Novel Weighting Scheme for Unsupervised Language Model Adaptation Using Latent Dirichlet Allocation
    Haidar, Md Akmal
    O'Shaughnessy, Douglas
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2438 - 2441
  • [47] UNSUPERVISED CV LANGUAGE MODEL ADAPTATION BASED ON DIRECT LIKELIHOOD MAXIMIZATION SENTENCE SELECTION
    Shinozaki, Takahiro
    Horiuchi, Yasuo
    Kuroiwa, Shingo
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5029 - 5032
  • [48] Unsupervised language model adaptation via topic modeling based on named entity hypotheses
    Liu, Yang
    Liu, Feifan
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4921 - 4924
  • [49] An Acoustic Model For English Speech Recognition Based On Deep Learning
    Ling, Zhang
    [J]. 2019 11TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2019), 2019, : 610 - 614
  • [50] Towards Accurate Model Selection in Deep Unsupervised Domain Adaptation
    You, Kaichao
    Wang, Ximei
    Long, Mingsheng
    Jordan, Michael I.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97