Pre-training Intent-Aware Encoders for Zero- and Few-Shot Intent Classification

被引:0
|
作者
Sung, Mujeen [1 ]
Gun, James [2 ]
Mansimov, Elman [2 ]
Pappas, Nikolaos [2 ]
Shu, Raphael [2 ]
Romeo, Salvatore [2 ]
Zhang, Yi [2 ]
Castelli, Vittorio [2 ]
机构
[1] Korea Univ, Seoul, South Korea
[2] AWS AI Labs, New York, NY USA
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Intent classification (IC) plays an important role in task-oriented dialogue systems. However, IC models often generalize poorly when training without sufficient annotated examples for each user intent. We propose a novel pre-training method for text encoders that uses contrastive learning with intent psuedo-labels to produce embeddings that are well-suited for IC tasks, reducing the need for manual annotations. By applying this pre-training strategy, we also introduce Pre-trained Intent-aware Encoder (PIE), which is designed to align encodings of utterances with their intent names. Specifically, we first train a tagger to identify key phrases within utterances that are crucial for interpreting intents. We then use these extracted phrases to create examples for pre-training a text encoder in a contrastive manner. As a result, our PIE model achieves up to 5.4% and 4.0% higher accuracy than the previous state-of-the-art text encoder for the N-way zero- and one-shot settings on four IC datasets.
引用
收藏
页码:10433 / 10442
页数:10
相关论文
共 50 条
  • [1] Effectiveness of Pre-training for Few-shot Intent Classification
    Zhang, Haode
    Zhang, Yuwei
    Zhan, Li-Ming
    Chen, Jiaxin
    Shi, Guangyuan
    Wu, Xiao-Ming
    Lam, Albert Y. S.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1114 - 1120
  • [2] Few-Shot Intent Detection via Contrastive Pre-Training and Fine-Tuning
    Zhang, Jian-Guo
    Bui, Trung
    Yoon, Seunghyun
    Chen, Xiang
    Liu, Zhiwei
    Xia, Congying
    Tran, Quan Hung
    Chang, Walter
    Yu, Philip
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 1906 - 1912
  • [3] Label Semantic Aware Pre-training for Few-shot Text Classification
    Mueller, Aaron
    Krone, Jason
    Romeo, Salvatore
    Mansour, Saab
    Mansimov, Elman
    Zhang, Yi
    Roth, Dan
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 8318 - 8334
  • [4] Revisit Few-shot Intent Classification with PLMs: Direct Fine-tuning vs. Continual Pre-training
    Zhang, Haode
    Liang, Haowen
    Zh, Liming
    Lam, Albert Y. S.
    Wu, Xiao-Ming
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 11105 - 11119
  • [5] The Devil is in the Details: On Models and Training Regimes for Few-Shot Intent Classification
    Mesgar, Mohsen
    Thy Thy Tran
    Glavas, Goran
    Gurevych, Iryna
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1846 - 1857
  • [6] Fine-tuning Pre-trained Language Models for Few-shot Intent Detection: Supervised Pre-training and Isotropization
    Zhang, Haode
    Liang, Haowen
    Zhang, Yuwei
    Zhan, Liming
    Wu, Xiao-Ming
    Lu, Xiaolei
    Lam, Albert Y. S.
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 532 - 542
  • [7] WinCLIP: Zero-/Few-Shot Anomaly Classification and Segmentation
    Jeong, Jongheon
    Zou, Yang
    Kim, Taewan
    Zhang, Dongqing
    Ravichandran, Avinash
    Dabeer, Onkar
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19606 - 19616
  • [8] Few-shot Intent Classification and Slot Filling with Retrieved Examples
    Yu, Dian
    He, Luheng
    Zhang, Yuan
    Du, Xinya
    Pasupat, Panupong
    Li, Qi
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 734 - 749
  • [9] PROTODA: EFFICIENT TRANSFER LEARNING FOR FEW-SHOT INTENT CLASSIFICATION
    Kumar, Manoj
    Kumar, Varun
    Glaude, Hadrien
    Delichy, Cyprien
    Alok, Aman
    Gupta, Rahul
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 966 - 972
  • [10] Decoupling Representation and Knowledge for Few-Shot Intent Classification and Slot Filling
    Han, Jie
    Zou, Yixiong
    Wang, Haozhao
    Wang, Jun
    Liu, Wei
    Wu, Yao
    Zhang, Tao
    Li, Ruixuan
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 18171 - 18179