Combining active and semi-supervised learning for spoken language understanding

被引:136
|
作者
Tur, G
Hakkani-Tür, D
Schapire, RE
机构
[1] AT&T Labs Res, Florham Pk, NJ 07932 USA
[2] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
关键词
active learning; semi-supervised learning; spoken language understanding; call classification;
D O I
10.1016/j.specom.2004.08.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we describe active and semi-supervised learning methods for reducing the labeling effort for spoken language understanding. In a goal-oriented call routing system, understanding the intent of the user can be framed as a classification problem. State of the art statistical classification systems are trained using a large number of human-labeled utterances, preparation of which is labor intensive and time consuming. Active learning aims to minimize the number of labeled utterances by automatically selecting the utterances that are likely to be most informative for labeling. The method for active learning we propose, inspired by certainty-based active learning, selects the examples that the classifier is the least confident about. The examples that are classified with higher confidence scores (hence not selected by active learning) are exploited using two semi-supervised learning methods. The first method augments the training data by using the machine-labeled classes for the unlabeled utterances. The second method instead augments the classification model trained using the human-labeled utterances with the machine-labeled ones in a weighted manner. We then combine active and semi-supervised learning using selectively sampled and automatically labeled data. This enables us to exploit all collected data and alleviates the data imbalance problem caused by employing only active or semi-supervised learning. We have evaluated these active and semi-supervised learning methods with a call classification system used for AT&T customer care. Our results indicate that it is possible to reduce human labeling effort significantly. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:171 / 186
页数:16
相关论文
共 50 条
  • [21] EFFICIENT SEMI-SUPERVISED LEARNING FOR NATURAL LANGUAGE UNDERSTANDING BY OPTIMIZING DIVERSITY
    Cho, Eunah
    Xie, He
    Lalor, John P.
    Kumar, Varun
    Campbell, William M.
    [J]. 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 1077 - 1084
  • [22] Analysis of active semi-supervised learning
    Berton, Lilian
    Mitsuishi, Felipe Baz
    Vega-Oliveros, Didier A.
    [J]. 38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 1122 - 1129
  • [23] Combining smooth graphs with semi-supervised learning
    Liu, Liang
    Chen, Weijun
    Wang, Jianmin
    [J]. ADVANCES IN DATA AND WEB MANAGEMENT, PROCEEDINGS, 2007, 4505 : 329 - +
  • [24] Active and Semi-Supervised Learning in ASR: Benefits on the Acoustic and Language Models
    Drugman, Thomas
    Pylkkonen, Janne
    Kneser, Reinhard
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2318 - 2322
  • [25] An active learning framework for semi-supervised document clustering with language modeling
    Huang, Ruizhang
    Lam, Wai
    [J]. DATA & KNOWLEDGE ENGINEERING, 2009, 68 (01) : 49 - 67
  • [26] Classification of acoustical signals by combining active learning strategies with semi-supervised learning schemes
    Karlos, Stamatis
    Aridas, Christos
    Kanas, Vasileios G.
    Kotsiantis, Sotiris
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (01): : 3 - 20
  • [27] Combining active learning and semi-supervised learning techniques to extract protein interaction sentences
    Song, Min
    Yu, Hwanjo
    Han, Wook-Shin
    [J]. BMC BIOINFORMATICS, 2011, 12 : S4
  • [28] Classification of acoustical signals by combining active learning strategies with semi-supervised learning schemes
    Stamatis Karlos
    Christos Aridas
    Vasileios G. Kanas
    Sotiris Kotsiantis
    [J]. Neural Computing and Applications, 2023, 35 : 3 - 20
  • [29] Combining active learning and semi-supervised learning techniques to extract protein interaction sentences
    Min Song
    Hwanjo Yu
    Wook-Shin Han
    [J]. BMC Bioinformatics, 12
  • [30] Combining semi-supervised and active learning to rank algorithms: application to Document Retrieval
    Faiza Dammak
    Hager Kammoun
    [J]. Information Retrieval Journal, 2021, 24 : 371 - 399