Recent Advances of Foundation Language Models-based Continual Learning: A Survey

被引:0
|
作者
Yang, Yutao [1 ]
Zhou, Jie [1 ]
Ding, Xuan wen [1 ]
Huai, Tianyu [1 ]
Liu, Shunyu [1 ]
Chen, Qin [1 ]
Xie, Yuan [1 ]
He, Liang [1 ]
机构
[1] East China Normal Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China
关键词
Continual learning; foundation language models; pre-trained language models; large language models; vision-language models; survey; NEURAL-NETWORKS; LIFELONG;
D O I
10.1145/3705725
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recently, foundation language models (LMs) have marked significant achievements in the domains of natural language processing and computer vision. Unlike traditional neural network models, foundation LMs obtain a great ability for transfer learning by acquiring rich common sense knowledge through pre-training on extensive unsupervised datasets with a vast number of parameters. Despite these capabilities, LMs still struggle with catastrophic forgetting, hindering their ability to learn continuously like humans. To address this, continual learning (CL) methodologies have been introduced, allowing LMs to adapt to new tasks while retaining learned knowledge. However, a systematic taxonomy of existing approaches and a comparison of their performance are still lacking. In this article, we delve into a comprehensive review, summarization, and classification of the existing literature on CL-based approaches applied to foundation language models, such as pre-trained language models, large language models, and vision-language models. We divide these studies into offline and online CL, which consist of traditional methods, parameter-efficient-based methods, instruction tuning-based methods and continual pre-training methods. Additionally, we outline the typical datasets and metrics employed in CL research and provide a detailed analysis of the challenges and future work for LMs-based continual learning.
引用
收藏
页数:38
相关论文
共 50 条
  • [21] Malicious Models-based Federated Learning in Fog Computing Networks
    Huang, Xiaoge
    Ren, Yang
    He, Yong
    Chen, Qianbin
    2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 192 - 196
  • [22] A Survey of Recent Advances in Deep Learning Models for Detecting Malware in Desktop and Mobile Platforms
    Maniriho, Pascal
    Mahmood, Abdun Naser
    Chowdhury, Mohammad Jabed Morshed
    ACM COMPUTING SURVEYS, 2024, 56 (06)
  • [23] Recent advances on constraint-based models by integrating machine learning
    Rana, Pratip
    Berry, Carter
    Ghosh, Preetam
    Fong, Stephen S.
    CURRENT OPINION IN BIOTECHNOLOGY, 2020, 64 : 85 - 91
  • [24] Recent Advances in Intelligent Source Code Generation: A Survey on Natural Language Based Studies
    Yang, Chen
    Liu, Yan
    Yin, Changqing
    ENTROPY, 2021, 23 (09)
  • [25] Recent Advances in Conventional and Deep Learning-Based Depth Completion: A Survey
    Xie, Zexiao
    Yu, Xiaoxuan
    Gao, Xiang
    Li, Kunqian
    Shen, Shuhan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3395 - 3415
  • [26] A Survey on Some Recent Advances in Shared Memory Models
    Rajsbaum, Sergio
    Raynal, Michel
    STRUCTURAL INFORMATION AND COMMUNICATION COMPLEXITY, 2011, 6796 : 17 - 28
  • [27] A Survey on Recent Advances in Machine Learning Based Sleep Apnea Detection Systems
    Ramachandran, Anita
    Karuppiah, Anupama
    HEALTHCARE, 2021, 9 (07)
  • [28] Recent Advances on Generative Models for Semantic Segmentation: A Survey
    Bhurtel, Manish
    Rawat, Danda B.
    Rice, Daniel O.
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS VI, 2024, 13051
  • [29] An advanced deep learning models-based plant disease detection: a review of recent research (vol 14, 1158933, 2023)
    Shoaib, Muhammad
    Shah, Babar
    EI-Sappagh, Shaker
    Ali, Akhtar
    Ullah, Asad
    Alenezi, Fayadh
    Gechev, Tsanko
    Hussain, Tariq
    Ali, Farman
    FRONTIERS IN PLANT SCIENCE, 2023, 14
  • [30] Recent Advances of Deep Learning for Sign Language Recognition
    Zheng, Lihong
    Liang, Bin
    Jiang, Ailian
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 454 - 460