Lifelong Learning With Cycle Memory Networks

被引：15

作者：

Peng, Jian ^{[1
,2
]}

Ye, Dingqi ^{[2
,3
]}

Tang, Bo ^{[4
]}

Lei, Yinjie ^{[5
]}

Liu, Yu ^{[6
]}

Li, Haifeng ^{[2
,3
]}

机构：

[1] Tsinghua Univ, Dept Precis Instrument, Beijing 100084, Peoples R China

[2] Xiangjiang Lab, Changsha 410205, Peoples R China

[3] Cent South Univ, Sch Geosci & Infophys, Changsha 410083, Peoples R China

[4] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA

[5] Sichuan Univ, Coll Elect & Informat Engn, Chengdu 610017, Peoples R China

[6] Peking Univ, Sch Earth & Space Sci, Beijing 100871, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2023年 / 35卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Anterograde forgetting; catastrophic forgetting; complementary learning theory; cycle memory network (CMN); lifelong learning;

D O I：

10.1109/TNNLS.2023.3294495

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning from a sequence of tasks for a lifetime is essential for an agent toward artificial general intelligence. Despite the explosion of this research field in recent years, most work focuses on the well-known catastrophic forgetting issue. In contrast, this work aims to explore knowledge-transferable lifelong learning without storing historical data and significant additional computational overhead. We demonstrate that existing data-free frameworks, including regularization-based single-network and structure-based multinetwork frameworks, face a fundamental issue of lifelong learning, named anterograde forgetting, i.e., preserving and transferring memory may inhibit the learning of new knowledge. We attribute it to the fact that the learning network capacity decreases while memorizing historical knowledge and conceptual confusion between the irrelevant old knowledge and the current task. Inspired by the complementary learning theory in neuroscience, we endow artificial neural networks with the ability to continuously learn without forgetting while recalling historical knowledge to facilitate learning new knowledge. Specifically, this work proposes a general framework named cycle memory networks (CMNs). The CMN consists of two individual memory networks to store short- and long-term memories separately to avoid capacity shrinkage and a transfer cell between them. It enables knowledge transfer from the long-term to the short-term memory network to mitigate conceptual confusion. In addition, the memory consolidation mechanism integrates short-term knowledge into the long-term memory network for knowledge accumulation. We demonstrate that the CMN can effectively address the anterograde forgetting on several task-related, task-conflict, class-incremental, and crossdomain benchmarks. Furthermore, we provide extensive ablation studies to verify each framework component. The source codes are available at: https://github.com/GeoX-Lab/CMN.

引用

页码：1 / 14

页数：14

共 50 条

[1] Lifelong Learning With Cycle Memory Networks
Peng, Jian
Ye, Dingqi
Tang, Bo
Lei, Yinjie
Liu, Yu
Li, Haifeng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 16439 - 16452
[2] Lifelong Learning Memory Networks for Aspect Sentiment Classification
Wang, Shuai
Lv, Guangyi
Mazumder, Sahisnu
Fei, Geli
Liu, Bing
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 861 - 870
[3] Generative Memory for Lifelong Learning
Su, Xin
Guo, Shangqi
Tan, Tian
Chen, Feng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (06) : 1884 - 1898
[4] Lifelong Learning Networks: Beyond Single Agent Lifelong Learning
Rostami, Mohammad
Eaton, Eric
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8145 - 8146
[5] Episodic Memory in Lifelong Language Learning
d'Autume, Cyprien de Masson
Ruder, Sebastian
Kong, Lingpeng
Yogatama, Dani
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[6] Learning Networks for Lifelong Competence Development
Stefanov, Krassen
Koper, Rob
INTERACTIVE LEARNING ENVIRONMENTS, 2007, 15 (02) : 101 - 105
[7] Lifelong Learning in Artificial Neural Networks
Anthes, Gary
COMMUNICATIONS OF THE ACM, 2019, 62 (06) : 13 - 15
[8] A design model for lifelong learning networks
Koper, R
Giesbers, B
van Rosmalen, P
Sloep, P
van Bruggen, J
Tattersall, C
Vogten, H
Brouns, F
INTERACTIVE LEARNING ENVIRONMENTS, 2005, 13 (1-2) : 71 - 92
[9] The Lifelong Learning in the University: Learning Networks and Knowledge Transferring
Ardimento, Pasquale
Boffoli, Nicola
Convertini, Vito Nicola
Visaggio, Giuseppe
JOURNAL OF E-LEARNING AND KNOWLEDGE SOCIETY, 2011, 7 (01): : 21 - +
[10] Lifelong representation learning in dynamic attributed networks
Wei, Hao
Hu, Guyu
Bai, Wei
Xia, Shiming
Pan, Zhisong
NEUROCOMPUTING, 2019, 358 : 1 - 9

← 1 2 3 4 5 →