Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion

被引:87
|
作者
Yu, Dong [1 ]
Varadarajan, Balakrishnan [2 ]
Deng, Li [1 ]
Acero, Alex [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
[2] Johns Hopkins Univ, Baltimore, MD 21218 USA
来源
COMPUTER SPEECH AND LANGUAGE | 2010年 / 24卷 / 03期
关键词
Active learning; Semi-supervised learning; Acoustic model; Entropy reduction; Confidence; Lattice; Collective information;
D O I
10.1016/j.csl.2009.03.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a unified global entropy reduction maximization (GERM) framework for active learning and semi-supervised learning for speech recognition. Active learning aims to select a limited subset of utterances for transcribing from a large amount of un-transcribed utterances, while semi-supervised learning addresses the problem of selecting right transcriptions for un-transcribed utterances, so that the accuracy of the automatic speech recognition system can be maximized. We show that both the traditional confidence-based active learning and semi-supervised learning approaches can be improved by maximizing the lattice entropy reduction over the whole dataset. We introduce our criterion and framework, show how the criterion can be simplified and approximated, and describe how these approaches can be combined. We demonstrate the effectiveness of our new framework and algorithm with directory assistance data collected under the real usage scenarios and show that our GERM based active learning and semi-supervised learning algorithms consistently outperform the confidence-based counterparts by a significant margin. Using our new active learning algorithm cuts the number of utterances needed for transcribing by 50% to achieve the same recognition accuracy obtained using the confidence-based active learning approach, and by 60% compared to the random sampling approach. Using our new semi-supervised algorithm we can determine the cutoff point in determining which utterance-transcription pair to use in a principled way by demonstrating that the point it finds is very close to the achievable peak point. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:433 / 444
页数:12
相关论文
共 50 条
  • [41] Semi-FedSER: Semi-supervised Learning for Speech Emotion Recognition On Federated Learning using Multiview Pseudo-Labeling
    Feng, Tiantian
    Narayanan, Shrikanth
    INTERSPEECH 2022, 2022, : 5050 - 5054
  • [42] An active semi-supervised deep learning model for human activity recognition
    Bi, Haixia
    Perello-Nieto, Miquel
    Santos-Rodriguez, Raul
    Flach, Peter
    Craddock, Ian
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 14 (10) : 13049 - 13065
  • [43] An active semi-supervised deep learning model for human activity recognition
    Haixia Bi
    Miquel Perello-Nieto
    Raul Santos-Rodriguez
    Peter Flach
    Ian Craddock
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 13049 - 13065
  • [44] A Graph Based Subspace Semi-supervised Learning Framework for Dimensionality Reduction
    Yang, Wuyi
    Zhang, Shuwu
    Liang, Wei
    COMPUTER VISION - ECCV 2008, PT II, PROCEEDINGS, 2008, 5303 : 664 - 677
  • [45] Semi-supervised Clustering Framework Based on Active Learning for Real Data
    Odate, Ryosuke
    Shinjo, Hiroshi
    Suzuki, Yasufumi
    Motobayashi, Masahiro
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2018, 2018, 11004 : 184 - 193
  • [46] An active learning framework for semi-supervised document clustering with language modeling
    Huang, Ruizhang
    Lam, Wai
    DATA & KNOWLEDGE ENGINEERING, 2009, 68 (01) : 49 - 67
  • [47] Instance segmentation using semi-supervised learning for fire recognition
    Sun, Guangmin
    Wen, Yuxuan
    Li, Yu
    HELIYON, 2022, 8 (12)
  • [48] Control chart pattern recognition using semi-supervised learning
    Yang, Miin-Shen
    Yang, Jenn-Hwai
    PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED COMPUTER SCIENCE: COMPUTER SCIENCE CHALLENGES, 2007, : 272 - +
  • [49] Recognition and Classifying Sales Flyers Using Semi-Supervised Learning
    Mosquera, Harlinton Palacios
    Genc, Yakup
    2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 382 - 387
  • [50] Combining Active Learning and Semi-supervised Learning by Using Selective Label Spreading
    Chen, Xu
    Wang, Tao
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 850 - 857