Combining active and semi-supervised learning for spoken language understanding

被引:136
|
作者
Tur, G
Hakkani-Tür, D
Schapire, RE
机构
[1] AT&T Labs Res, Florham Pk, NJ 07932 USA
[2] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
关键词
active learning; semi-supervised learning; spoken language understanding; call classification;
D O I
10.1016/j.specom.2004.08.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we describe active and semi-supervised learning methods for reducing the labeling effort for spoken language understanding. In a goal-oriented call routing system, understanding the intent of the user can be framed as a classification problem. State of the art statistical classification systems are trained using a large number of human-labeled utterances, preparation of which is labor intensive and time consuming. Active learning aims to minimize the number of labeled utterances by automatically selecting the utterances that are likely to be most informative for labeling. The method for active learning we propose, inspired by certainty-based active learning, selects the examples that the classifier is the least confident about. The examples that are classified with higher confidence scores (hence not selected by active learning) are exploited using two semi-supervised learning methods. The first method augments the training data by using the machine-labeled classes for the unlabeled utterances. The second method instead augments the classification model trained using the human-labeled utterances with the machine-labeled ones in a weighted manner. We then combine active and semi-supervised learning using selectively sampled and automatically labeled data. This enables us to exploit all collected data and alleviates the data imbalance problem caused by employing only active or semi-supervised learning. We have evaluated these active and semi-supervised learning methods with a call classification system used for AT&T customer care. Our results indicate that it is possible to reduce human labeling effort significantly. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:171 / 186
页数:16
相关论文
共 50 条
  • [1] Semi-supervised learning for spoken language understanding using semantic role labeling
    Tur, G
    Hakkani-Tür, D
    Chotimongkol, A
    [J]. 2005 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2005, : 232 - 237
  • [2] The impact of intent distribution mismatch on semi-supervised spoken language understanding
    Gaspers, Judith
    Quynh Do
    Sorokin, Daniil
    Lehnen, Patrick
    [J]. INTERSPEECH 2021, 2021, : 4708 - 4712
  • [3] SEMI-SUPERVISED TRAINING USING ADVERSARIAL MULTI-TASK LEARNING FOR SPOKEN LANGUAGE UNDERSTANDING
    Lan, Ouyu
    Zhu, Su
    Yu, Kai
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6049 - 6053
  • [4] Dual Learning for Semi-Supervised Natural Language Understanding
    Zhu, Su
    Cao, Ruisheng
    Yu, Kai
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1936 - 1947
  • [5] SEMI-SUPERVISED SPOKEN LANGUAGE UNDERSTANDING VIA SELF-SUPERVISED SPEECH AND LANGUAGE MODEL PRETRAINING
    Lai, Cheng-, I
    Chuang, Yung-Sung
    Lee, Hung-Yi
    Li, Shang-Wen
    Glass, James
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7468 - 7472
  • [6] Network Security Monitoring by Combining Semi-Supervised Learning and Active Learning
    Pan, Yun
    [J]. INTERNATIONAL JOURNAL OF INFORMATION SYSTEM MODELING AND DESIGN, 2022, 13 (02)
  • [7] A Semi-supervised Method for Efficient Construction of Statistical Spoken Language Understanding Resources
    Kim, Seokhwan
    Jeong, Minwoo
    Lee, Gary Geunbae
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 977 - 980
  • [8] Semi-supervised learning combining co-training with active learning
    Zhang, Yihao
    Wen, Junhao
    Wang, Xibin
    Jiang, Zhuo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (05) : 2372 - 2378
  • [9] Combining active learning and semi-supervised learning to construct SVM classifier
    Leng, Yan
    Xu, Xinyan
    Qi, Guanghui
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 44 : 121 - 131
  • [10] Combining Committee-Based Semi-Supervised Learning and Active Learning
    Hady, Mohamed Farouk Abdel
    Schwenker, Friedhelm
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2010, 25 (04): : 681 - 698