InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspective

被引:0
|
作者
Song, Yifan [1 ]
Wang, Peiyi [1 ]
Xiong, Weimin [1 ]
Zhu, Dawei [1 ]
Liu, Tianyu [2 ]
Sui, Zhifang [1 ]
Li, Sujian [1 ]
机构
[1] Peking Univ, Sch Comp Sci, Natl Key Lab Multimedia Informat Proc, Beijing, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023) | 2023年
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Continual learning (CL) aims to constantly learn new knowledge over time while avoiding catastrophic forgetting on old tasks. We focus on continual text classification under the class-incremental setting. Recent CL studies have identified the severe performance decrease on analogous classes as a key factor for catastrophic forgetting. In this paper, through an in-depth exploration of the representation learning process in CL, we discover that the compression effect of the information bottleneck leads to confusion on analogous classes. To enable the model learn more sufficient representations, we propose a novel replay-based continual text classification method, InfoCL. Our approach utilizes fast-slow and current-past contrastive learning to perform mutual information maximization and better recover the previously learned representations. In addition, InfoCL incorporates an adversarial memory augmentation strategy to alleviate the overfitting problem of replay. Experimental results demonstrate that InfoCL effectively mitigates forgetting and achieves state-of-the-art performance on three text classification tasks. The code is publicly available at https://github. com/Yifan-Song793/InfoCL.
引用
收藏
页码:14557 / 14570
页数:14
相关论文
共 45 条
  • [41] Enhanced Information Retrieval from Narrative German-language Clinical Text Documents using Automated Document Classification
    Spat, Stephan
    Cadonna, Bruno
    Rakovac, Ivo
    Guetl, Christian
    Leitner, Hubert
    Stark, Guenther
    Beck, Peter
    EHEALTH BEYOND THE HORIZON - GET IT THERE, 2008, 136 : 473 - +
  • [42] Policy text quantification of urban waste classification from the perspective of policy tools: Based on the analysis of 6 cities in Northwest China
    FAN Xiao-cao
    Ecological Economy, 2024, 20 (04) : 350 - 371
  • [43] Personal information management on social media from the perspective of platform support: a text analysis based on the Chinese social media platform policy
    Zhou, Wenhong
    Dai, Linxu
    Zhang, Yujie
    Wen, Chuanling
    ONLINE INFORMATION REVIEW, 2022, 46 (01) : 1 - 21
  • [44] POOR-DATA, LOW-INFORMATION AND BLACK KNOWLEDGE IN CONTEXT OF GREY SYSTEMS. AN ANALYSIS FROM THE PERSPECTIVE OF UNCERTAINTY IN MULTINOMIAL CLASSIFICATION
    Scarlat, Emil
    Maracine, Virginia
    Barbul, Andrada
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON INFORMATICS IN ECONOMY (IE 2017): EDUCATION, RESEARCH & BUSINESS TECHNOLOGIES, 2017, : 448 - 456
  • [45] Representing and organizing information to describe the lived experience of health from a personal factors perspective in the light of the International Classification of Functioning, Disability and Health (ICF): a discussion paper
    Geyh, Szilvia
    Schwegler, Urban
    Peter, Claudio
    Mueller, Rachel
    DISABILITY AND REHABILITATION, 2019, 41 (14) : 1727 - 1738