Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion

被引:87
|
作者
Yu, Dong [1 ]
Varadarajan, Balakrishnan [2 ]
Deng, Li [1 ]
Acero, Alex [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
[2] Johns Hopkins Univ, Baltimore, MD 21218 USA
来源
COMPUTER SPEECH AND LANGUAGE | 2010年 / 24卷 / 03期
关键词
Active learning; Semi-supervised learning; Acoustic model; Entropy reduction; Confidence; Lattice; Collective information;
D O I
10.1016/j.csl.2009.03.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a unified global entropy reduction maximization (GERM) framework for active learning and semi-supervised learning for speech recognition. Active learning aims to select a limited subset of utterances for transcribing from a large amount of un-transcribed utterances, while semi-supervised learning addresses the problem of selecting right transcriptions for un-transcribed utterances, so that the accuracy of the automatic speech recognition system can be maximized. We show that both the traditional confidence-based active learning and semi-supervised learning approaches can be improved by maximizing the lattice entropy reduction over the whole dataset. We introduce our criterion and framework, show how the criterion can be simplified and approximated, and describe how these approaches can be combined. We demonstrate the effectiveness of our new framework and algorithm with directory assistance data collected under the real usage scenarios and show that our GERM based active learning and semi-supervised learning algorithms consistently outperform the confidence-based counterparts by a significant margin. Using our new active learning algorithm cuts the number of utterances needed for transcribing by 50% to achieve the same recognition accuracy obtained using the confidence-based active learning approach, and by 60% compared to the random sampling approach. Using our new semi-supervised algorithm we can determine the cutoff point in determining which utterance-transcription pair to use in a principled way by demonstrating that the point it finds is very close to the achievable peak point. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:433 / 444
页数:12
相关论文
共 50 条
  • [1] A Unified Active and Semi-Supervised Learning Framework for Image Compression
    He, Xiaofei
    Ji, Ming
    Bao, Hujun
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 65 - 72
  • [2] A unified semi-supervised dimensionality reduction framework for manifold learning
    Chatpatanasiri, Ratthachat
    Kijsirikul, Boonserm
    NEUROCOMPUTING, 2010, 73 (10-12) : 1631 - 1640
  • [3] Learning Semi-Supervised Representation Towards a Unified Optimization Framework for Semi-Supervised Learning
    Li, Chun-Guang
    Lin, Zhouchen
    Zhang, Honggang
    Guo, Jun
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2767 - 2775
  • [4] A unified framework for semi-supervised PU learning
    Haoji Hu
    Chaofeng Sha
    Xiaoling Wang
    Aoying Zhou
    World Wide Web, 2014, 17 : 493 - 510
  • [5] A unified framework for semi-supervised PU learning
    Hu, Haoji
    Sha, Chaofeng
    Wang, Xiaoling
    Zhou, Aoying
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2014, 17 (04): : 493 - 510
  • [6] USING COLLECTIVE INFORMATION IN SEMI-SUPERVISED LEARNING FOR SPEECH RECOGNITION
    Varadarajan, Balakrishnan
    Yu, Dong
    Deng, Li
    Acero, Alex
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4633 - +
  • [7] MAXIMIZING GLOBAL ENTROPY REDUCTION FOR ACTIVE LEARNING IN SPEECH RECOGNITION
    Varadarajan, Balakrishnan
    Yu, Dong
    Deng, Li
    Acero, Alex
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4721 - +
  • [8] Handwritten Character Recognition Using Active Semi-supervised Learning
    Inkeaw, Papangkorn
    Bootkrajang, Jakramate
    Goncalves, Teresa
    Chaijaruwanich, Jeerayut
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2018, PT I, 2018, 11314 : 69 - 78
  • [9] Combining active learning and Semi-supervised learning using local and Global consistency
    Gu, Yingjie
    Jin, Zhong
    Chiu, Steve C
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8834 : 215 - 222
  • [10] Combining Active Learning and Semi-supervised Learning Using Local and Global Consistency
    Gu, Yingjie
    Jin, Zhong
    Chiu, Steve C.
    NEURAL INFORMATION PROCESSING (ICONIP 2014), PT I, 2014, 8834 : 215 - 222