Pre-training Intent-Aware Encoders for Zero- and Few-Shot Intent Classification

被引:0
|
作者
Sung, Mujeen [1 ]
Gun, James [2 ]
Mansimov, Elman [2 ]
Pappas, Nikolaos [2 ]
Shu, Raphael [2 ]
Romeo, Salvatore [2 ]
Zhang, Yi [2 ]
Castelli, Vittorio [2 ]
机构
[1] Korea Univ, Seoul, South Korea
[2] AWS AI Labs, New York, NY USA
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Intent classification (IC) plays an important role in task-oriented dialogue systems. However, IC models often generalize poorly when training without sufficient annotated examples for each user intent. We propose a novel pre-training method for text encoders that uses contrastive learning with intent psuedo-labels to produce embeddings that are well-suited for IC tasks, reducing the need for manual annotations. By applying this pre-training strategy, we also introduce Pre-trained Intent-aware Encoder (PIE), which is designed to align encodings of utterances with their intent names. Specifically, we first train a tagger to identify key phrases within utterances that are crucial for interpreting intents. We then use these extracted phrases to create examples for pre-training a text encoder in a contrastive manner. As a result, our PIE model achieves up to 5.4% and 4.0% higher accuracy than the previous state-of-the-art text encoder for the N-way zero- and one-shot settings on four IC datasets.
引用
收藏
页码:10433 / 10442
页数:10
相关论文
共 50 条
  • [41] Virtual prompt pre-training for prototype-based few-shot relation extraction
    He, Kai
    Huang, Yucheng
    Mao, Rui
    Gong, Tieliang
    Li, Chen
    Cambria, Erik
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [42] Cross-Modal Contrastive Pre-Training for Few-Shot Skeleton Action Recognition
    Lu, Mingqi
    Yang, Siyuan
    Lu, Xiaobo
    Liu, Jun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9798 - 9807
  • [43] Frequency Enhanced Pre-training for Cross-City Few-shot Traffic Forecasting
    Liu, Zhanyu
    Ding, Jianrong
    Zheng, Guanjie
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT II, ECML PKDD 2024, 2024, 14942 : 35 - 52
  • [44] HINT: Hypernetwork Instruction Tuning for Efficient Zero- & Few-Shot Generalisation
    Ivison, Hamish
    Bhagia, Akshita
    Wang, Yizhong
    Hajishirzi, Hannaneh
    Peters, Matthew
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 11272 - 11288
  • [45] Few-shot out-of-scope intent classification: analyzing the robustness of prompt-based learning
    Yiwei Jiang
    Maarten De Raedt
    Johannes Deleu
    Thomas Demeester
    Chris Develder
    Applied Intelligence, 2024, 54 : 1474 - 1496
  • [46] Few-shot out-of-scope intent classification: analyzing the robustness of prompt-based learning
    Jiang, Yiwei
    De Raedt, Maarten
    Deleu, Johannes
    Demeester, Thomas
    Develder, Chris
    APPLIED INTELLIGENCE, 2024, 54 (02) : 1474 - 1496
  • [47] Few-shot cyberviolence intent classification with Meta-learning AutoEncoder based on adversarial domain adaptation
    Yang, Shun
    Du, Yajun
    Du, Shangyi
    Li, Xianyong
    Chen, Xiaoliang
    Li, Yanli
    Xie, Chunzhi
    Liu, Jia
    NEUROCOMPUTING, 2025, 620
  • [48] Semi-supervised Meta-learning for Cross-domain Few-shot Intent Classification
    Li, Judith Yue
    Zhang, Jiong
    1ST WORKSHOP ON META LEARNING AND ITS APPLICATIONS TO NATURAL LANGUAGE PROCESSING (METANLP 2021), 2021, : 67 - 75
  • [49] Uncertainty-aware Self-training for Few-shot Text Classification
    Mukherjee, Subhabrata
    Awadallah, Ahmed Hassan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [50] Reconstructing Capsule Networks for Zero-shot Intent Classification
    Liu, Han
    Zhang, Xiaotong
    Fan, Lu
    Fu, Xuandi
    Li, Qimai
    Wu, Xiao-Ming
    Lam, Albert Y. S.
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 4799 - 4809