Induction Networks for Few-Shot Text Classification

被引:0
|
作者
Geng, Ruiying [1 ,2 ]
Li, Binhua [2 ]
Li, Yongbin [2 ]
Zhu, Xiaodan [3 ]
Jian, Ping [1 ]
Sun, Jian [2 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
[2] Alibaba Grp, Beijing, Peoples R China
[3] Queens Univ, ECE, Kingston, ON, Canada
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text classification tends to struggle when data is deficient or when it needs to adapt to unseen classes. In such challenging scenarios, recent studies have used meta-learning to simulate the few-shot task, in which new queries are compared to a small support set at the sample-wise level. However, this sample-wise comparison may be severely disturbed by the various expressions in the same class. Therefore, we should be able to learn a general representation of each class in the support set and then compare it to new queries. In this paper, we propose a novel Induction Network to learn such a generalized class-wise representation, by innovatively leveraging the dynamic routing algorithm in meta-learning. In this way, we find the model is able to induce and generalize better. We evaluate the proposed model on a well-studied sentiment classification dataset (English) and a real-world dialogue intent classification dataset (Chinese). Experiment results show that on both datasets, the proposed model significantly outperforms the existing state-of-the-art approaches, proving the effectiveness of class-wise generalization in few-shot text classification.
引用
收藏
页码:3904 / 3913
页数:10
相关论文
共 50 条
  • [1] Dynamic Memory Induction Networks for Few-Shot Text Classification
    Geng, Ruiying
    Li, Binhua
    Li, Yongbin
    Sun, Jian
    Zhu, Xiaodan
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1087 - 1094
  • [2] Hierarchical Attention Prototypical Networks for Few-Shot Text Classification
    Sun, Shengli
    Sun, Qingfeng
    Zhou, Kevin
    Lv, Tengchao
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 476 - 485
  • [3] Causal representation for few-shot text classification
    Yang, Maoqin
    Zhang, Xuejie
    Wang, Jin
    Zhou, Xiaobing
    APPLIED INTELLIGENCE, 2023, 53 (18) : 21422 - 21432
  • [4] Few-shot learning for short text classification
    Yan, Leiming
    Zheng, Yuhui
    Cao, Jie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (22) : 29799 - 29810
  • [5] Adversarial training for few-shot text classification
    Croce, Danilo
    Castellucci, Giuseppe
    Basili, Roberto
    INTELLIGENZA ARTIFICIALE, 2020, 14 (02) : 201 - 214
  • [6] Few-shot learning for short text classification
    Leiming Yan
    Yuhui Zheng
    Jie Cao
    Multimedia Tools and Applications, 2018, 77 : 29799 - 29810
  • [7] Causal representation for few-shot text classification
    Maoqin Yang
    Xuejie Zhang
    Jin Wang
    Xiaobing Zhou
    Applied Intelligence, 2023, 53 : 21422 - 21432
  • [8] Continual Few-Shot Learning for Text Classification
    Pasunuru, Ramakanth
    Stoyanov, Veselin
    Bansal, Mohit
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 5688 - 5702
  • [9] Few-Shot Text Classification with Triplet Networks, Data Augmentation, and Curriculum Learning
    Wei, Jason
    Huang, Chengyu
    Vosoughi, Soroush
    Cheng, Yu
    Xu, Shiqi
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5493 - 5500
  • [10] Few-Shot Text Classification with External Knowledge Expansion
    Guan, Jian
    Xu, Rui
    Ya, Jing
    Tang, Qiu
    Xue, Jidong
    Zhang, Ni
    2021 5TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE (ICIAI 2021), 2021, : 184 - 189