Acoustic model training using committee-based active and semi-supervised learning for speech recognition

被引:0
|
作者
Tsutaoka, Takuya [1 ]
Shinoda, Koichi [1 ]
机构
[1] Tokyo Inst Technol, Dept Comp Sci, Tokyo, Japan
关键词
active learning; semi-supervised learning; LVCSR; query by committee; GLOBAL ENTROPY REDUCTION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We propose an acoustic model training method which combines committee-based active learning and semi-supervised learning for large vocabulary continuous speech recognition. In this method, each untranscribed training utterance is examined by a committee of multiple speech recognizers, and the degree of disagreement in the committee on its transcription is used for selecting utterances. Those utterances the committee members disagree with each other are transcribed for active learning, while those they agree are used for semi-supervised learning. Our method was evaluated using the Corpus of Spontaneous Japanese. It was shown that it achieved higher recognition accuracy with lower transcription costs than random sampling, active learning alone, and semi-supervised learning alone. We also propose a new data selection method called middle selection in semi-supervised learning.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Combining Committee-Based Semi-Supervised Learning and Active Learning
    Mohamed Farouk Abdel Hady
    Friedhelm Schwenker
    [J]. Journal of Computer Science & Technology, 2010, 25 (04) : 681 - 698
  • [2] Combining Committee-Based Semi-Supervised Learning and Active Learning
    Hady, Mohamed Farouk Abdel
    Schwenker, Friedhelm
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2010, 25 (04): : 681 - 698
  • [3] Combining Committee-Based Semi-Supervised Learning and Active Learning
    Mohamed Farouk Abdel Hady
    Friedhelm Schwenker
    [J]. Journal of Computer Science and Technology, 2010, 25 : 681 - 698
  • [4] Combining Committee-Based Semi-supervised and Active Learning and Its Application to Handwritten Digits Recognition
    Hady, Mohamed Farouk Abdel
    Schwenker, Friedhelm
    [J]. MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2010, 5997 : 225 - 234
  • [5] Committee-Based Active Learning for Speech Recognition
    Hamanaka, Yuzo
    Shinoda, Koichi
    Tsutaoka, Takuya
    Furui, Sadaoki
    Emori, Tadashi
    Koshinaka, Takafumi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (10): : 2015 - 2023
  • [6] Semi-Supervised Speech Recognition Acoustic Model Training Using Policy Gradient
    Chung, Hoon
    Lee, Sung Joo
    Jeon, Hyeong Bae
    Park, Jeon Gue
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (10):
  • [7] Semi-Supervised Training of DNN-Based Acoustic Model for ATC Speech Recognition
    Smidl, Lubos
    Svec, Jan
    Prazak, Ales
    Trmal, Jan
    [J]. SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 646 - 655
  • [8] Semi-Supervised Training in Deep Learning Acoustic Model
    Huang, Yan
    Wang, Yongqiang
    Gong, Yifan
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3848 - 3852
  • [9] Lithuanian Broadcast Speech Transcription using Semi-supervised Acoustic Model Training
    Lileikyte, Rasa
    Gorin, Arseniy
    Lamel, Lori
    Gauvain, Jean-Luc
    Fraga-Silva, Thiago
    [J]. SLTU-2016 5TH WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGIES FOR UNDER-RESOURCED LANGUAGES, 2016, 81 : 107 - 113
  • [10] SPEECH MODELING BASED ON COMMITTEE-BASED ACTIVE LEARNING
    Hamanaka, Yuzo
    Shinoda, Koichi
    Furui, Sadaoki
    Emori, Tadashi
    Koshinaka, Takafumi
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4350 - 4353