Combining active and semi-supervised learning for spoken language understanding

被引:136
|
作者
Tur, G
Hakkani-Tür, D
Schapire, RE
机构
[1] AT&T Labs Res, Florham Pk, NJ 07932 USA
[2] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
关键词
active learning; semi-supervised learning; spoken language understanding; call classification;
D O I
10.1016/j.specom.2004.08.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we describe active and semi-supervised learning methods for reducing the labeling effort for spoken language understanding. In a goal-oriented call routing system, understanding the intent of the user can be framed as a classification problem. State of the art statistical classification systems are trained using a large number of human-labeled utterances, preparation of which is labor intensive and time consuming. Active learning aims to minimize the number of labeled utterances by automatically selecting the utterances that are likely to be most informative for labeling. The method for active learning we propose, inspired by certainty-based active learning, selects the examples that the classifier is the least confident about. The examples that are classified with higher confidence scores (hence not selected by active learning) are exploited using two semi-supervised learning methods. The first method augments the training data by using the machine-labeled classes for the unlabeled utterances. The second method instead augments the classification model trained using the human-labeled utterances with the machine-labeled ones in a weighted manner. We then combine active and semi-supervised learning using selectively sampled and automatically labeled data. This enables us to exploit all collected data and alleviates the data imbalance problem caused by employing only active or semi-supervised learning. We have evaluated these active and semi-supervised learning methods with a call classification system used for AT&T customer care. Our results indicate that it is possible to reduce human labeling effort significantly. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:171 / 186
页数:16
相关论文
共 50 条
  • [11] Combining Committee-Based Semi-Supervised Learning and Active Learning
    Mohamed Farouk Abdel Hady
    Friedhelm Schwenker
    [J]. Journal of Computer Science and Technology, 2010, 25 : 681 - 698
  • [12] Combining Semi-Supervised and Active Learning for Hyperspectral Image Classification
    Li, Mingzhi
    Wang, Rui
    Tang, Ke
    [J]. 2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING (CIDM), 2013, : 89 - 94
  • [13] Combining Committee-Based Semi-Supervised Learning and Active Learning
    Mohamed Farouk Abdel Hady
    Friedhelm Schwenker
    [J]. Journal of Computer Science & Technology, 2010, 25 (04) : 681 - 698
  • [14] A semi-supervised learning method for semantic modeling in language understanding
    Ortega, L.
    Galiano, I.
    Hurtado, L. F.
    Sanchis, E.
    Segarra, E.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (45): : 199 - 205
  • [15] Semi-Supervised Learning of Statistical Models for Natural Language Understanding
    Zhou, Deyu
    He, Yulan
    [J]. SCIENTIFIC WORLD JOURNAL, 2014,
  • [16] Industry Scale Semi-Supervised Learning for Natural Language Understanding
    Chen, Luoxin
    Garcia, Francisco
    Kumar, Varun
    Xie, He
    Lu, Jianhua
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2021, 2021, : 311 - 318
  • [17] Combining Active Learning and Semi-supervised Learning by Using Selective Label Spreading
    Chen, Xu
    Wang, Tao
    [J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 850 - 857
  • [18] Semi-supervised learning combining transductive support vector machine with active learning
    Lu, Boli
    Wang, Xibin
    [J]. PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MECHATRONICS, MATERIALS, CHEMISTRY AND COMPUTER ENGINEERING 2015 (ICMMCCE 2015), 2015, 39 : 31 - 40
  • [19] Combining Active Learning and Semi-supervised Learning Using Local and Global Consistency
    Gu, Yingjie
    Jin, Zhong
    Chiu, Steve C.
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2014), PT I, 2014, 8834 : 215 - 222
  • [20] Semi-supervised learning combining transductive support vector machine with active learning
    Wang, Xibin
    Wen, Junhao
    Alam, Shafiq
    Jiang, Zhuo
    Wu, Yingbo
    [J]. NEUROCOMPUTING, 2016, 173 : 1288 - 1298