Few-shot intent detection with self-supervised pretraining and prototype-aware attention

被引:0
|
作者
Yang, Shun [1 ]
Du, YaJun [1 ]
Zheng, Xin [1 ]
Li, XianYong [1 ]
Chen, XiaoLiang [1 ]
Li, YanLi [1 ]
Xie, ChunZhi [1 ]
机构
[1] Xihua Univ, Sch Comp & Software Engn, Chengdu 610065, Sichuan, Peoples R China
关键词
Few-shot learning; Intent detection; Prototype generation; Self-supervised learning; Contrastive learning;
D O I
10.1016/j.patcog.2024.110641
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few -shot intent detection is a more challenging application. However, traditional prototypical networks based on averaging often suffer from issues such as missing key information, poor generalization capabilities. In previous work, using three-dimensional convolutional neural networks (3DCNN) to generate prototype representations faces challenges with long-distance dependencies. Furthermore, a pretrained encoder's performance in a specific domain is often suboptimal because its knowledge of the specific domain is fragmented. Therefore, in this paper, we propose a simple yet effective two -stage learning strategy to address these issues. In the first stage, we propose a self -supervised multi -task pretraining (SMTP) strategy. SMTP utilizes unlabeled data from the current domain to help the pretrained encoder learn the semantic information of the text and implicitly distinguish semantically similar text representations without using any labels. SMTP aims to enhance the representation capability of the pretrained encoder in a specific domain. In the second stage, we propose a prototype -aware attention (PaAT) model to generate prototype representations of the same class. PaAT generates prototype representations by calculating the attention between class texts, which can effectively solve the long-distance dependence problem of 3DCNN. PaAT is a siamese architecture that can simultaneously generate prototype representations and sentence -level representations of unseen data. In addition, to prevent overfitting in few -shot learning, we introduce an unsupervised contrastive regularization term to constrain PaAT. Our method achieves state-of-the-art performance on four public datasets. 1
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Self-Supervised Task Augmentation for Few-Shot Intent Detection
    Peng-Fei Sun
    Ya-Wen Ouyang
    Ding-Jie Song
    Xin-Yu Dai
    [J]. Journal of Computer Science and Technology, 2022, 37 : 527 - 538
  • [2] Self-Supervised Task Augmentation for Few-Shot Intent Detection
    Sun, Peng-Fei
    Ouyang, Ya-Wen
    Song, Ding-Jie
    Dai, Xin-Yu
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2022, 37 (03) : 527 - 538
  • [3] Self-supervised Prototype Conditional Few-Shot Object Detection
    Kobayashi, Daisuke
    [J]. IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II, 2022, 13232 : 681 - 692
  • [4] A Survey of Self-Supervised and Few-Shot Object Detection
    Huang, Gabriel
    Laradji, Issam
    Vazquez, David
    Lacoste-Julien, Simon
    Rodriguez, Pau
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4071 - 4089
  • [5] Multi-task Self-supervised Few-Shot Detection
    Zhang, Guangyong
    Duan, Lijuan
    Wang, Wenjian
    Gong, Zhi
    Ma, Bian
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XII, 2024, 14436 : 107 - 119
  • [6] Self-Supervised Tuning for Few-Shot Segmentation
    Zhu, Kai
    Zhai, Wei
    Cao, Yang
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1019 - 1025
  • [7] A Novel Self-supervised Few-shot Network Intrusion Detection Method
    Zhang, Jing
    Shi, Zhixin
    Wu, Hao
    Xing, Mengyan
    [J]. WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS (WASA 2022), PT I, 2022, 13471 : 513 - 525
  • [8] FLUID: Few-Shot Self-Supervised Image Deraining
    Rai, Shyam Nandan
    Saluja, Rohit
    Arora, Chetan
    Balasubramanian, Vineeth N.
    Subramanian, Anbumani
    Jawahar, C., V
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 418 - 427
  • [9] Self-Supervised Few-Shot Learning on Point Clouds
    Sharma, Charu
    Kaul, Manohar
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [10] Conditional Self-Supervised Learning for Few-Shot Classification
    An, Yuexuan
    Xue, Hui
    Zhao, Xingyu
    Zhang, Lu
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2140 - 2146