Revisiting Neural Networks for Continual Learning: An Architectural Perspective

被引:0
|
作者
Lu, Aojun [1 ]
Feng, Tao [3 ]
Yuan, Hangjie [2 ]
Song, Xiaotian [1 ]
Sun, Yanan [1 ]
机构
[1] Sichuan Univ, Chengdu, Sichuan, Peoples R China
[2] Tsinghua Univ, Beijing, Peoples R China
[3] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
来源
PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024 | 2024年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Efforts to overcome catastrophic forgetting have primarily centered around developing more effective Continual Learning (CL) methods. In contrast, less attention was devoted to analyzing the role of network architecture design (e.g., network depth, width, and components) in contributing to CL. This paper seeks to bridge this gap between network architecture design and CL, and to present a holistic study on the impact of network architectures on CL. This work considers architecture design at the network scaling level, i.e., width and depth, and also at the network components, i.e., skip connections, global pooling layers, and down-sampling. In both cases, we first derive insights through systematically exploring how architectural designs affect CL. Then, grounded in these insights, we craft a specialized search space for CL and further propose a simple yet effective ArchCraft method to steer a CLfriendly architecture, namely, this method recrafts AlexNet/ResNet into AlexAC/ResAC. Experimental validation across various CL settings and scenarios demonstrates that improved architectures are parameter-efficient, achieving state-of-the-art performance of CL while being 86%, 61%, and 97% more compact in terms of parameters than the naive CL architecture in Task IL and Class IL. Code is available at https://github.com/byyx666/ArchCraft.
引用
收藏
页码:4651 / 4659
页数:9
相关论文
共 50 条
  • [21] CONTINUAL LEARNING ON FACIAL RECOGNITION USING CONVOLUTIONAL NEURAL NETWORKS
    Feng, Jingjing
    Gomez, Valentina
    UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2023, 85 (03): : 239 - 248
  • [22] Continual learning of context-dependent processing in neural networks
    Zeng, Guanxiong
    Chen, Yang
    Cui, Bo
    Yu, Shan
    NATURE MACHINE INTELLIGENCE, 2019, 1 (08) : 364 - 372
  • [23] CONTINUAL LEARNING ON FACIAL RECOGNITION USING CONVOLUTIONAL NEURAL NETWORKS
    Feng, Jingjing
    Gomez, Valentina
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2023, 85 (03): : 239 - 248
  • [24] Continual Learning in Convolutional Neural Networks with Tensor Rank Updates
    Krol, Matt
    Hyder, Rakib
    Peechatt, Michael
    Prater-Bennette, Ashley
    Asif, M. Salman
    Markopoulos, Panos P.
    2024 IEEE 13RD SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP, SAM 2024, 2024,
  • [25] Hierarchical Indian Buffet Neural Networks for Bayesian Continual Learning
    Kessler, Samuel
    Vu Nguyen
    Zohren, Stefan
    Roberts, Stephen J.
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 161, 2021, 161 : 749 - 759
  • [26] Continual learning of context-dependent processing in neural networks
    Guanxiong Zeng
    Yang Chen
    Bo Cui
    Shan Yu
    Nature Machine Intelligence, 2019, 1 : 364 - 372
  • [27] Continual Learning of Recurrent Neural Networks by Locally Aligning Distributed Representations
    Ororbia, Alexander
    Mali, Ankur
    Giles, C. Lee
    Kifer, Daniel
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (10) : 4267 - 4278
  • [28] Efficient Spiking Neural Networks with Sparse Selective Activation for Continual Learning
    Shen, Jiangrong
    Ni, Wenyao
    Xu, Qi
    Tang, Huajin
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 611 - 619
  • [29] Evolutionary FPGA-Based Spiking Neural Networks for Continual Learning
    Otero, Andres
    Sanllorente, Guillermo
    de la Torre, Eduardo
    Nunez-Yanez, Jose
    APPLIED RECONFIGURABLE COMPUTING. ARCHITECTURES, TOOLS, AND APPLICATIONS, ARC 2023, 2023, 14251 : 260 - 274
  • [30] Continual Learning with Deep Neural Networks in Physiological Signal Data: A Survey
    Li, Ao
    Li, Huayu
    Yuan, Geng
    HEALTHCARE, 2024, 12 (02)