A Study on Catastrophic Forgetting in Deep LSTM Networks

被引:17
|
作者
Schak, Monika [1 ]
Gepperth, Alexander [1 ]
机构
[1] Univ Appl Sci Fulda, D-36037 Fulda, Germany
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II | 2019年 / 11728卷
关键词
LSTM; Catastrophic Forgetting;
D O I
10.1007/978-3-030-30484-3_56
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a systematic study of Catastrophic Forgetting (CF), i.e., the abrupt loss of previously acquired knowledge, when retraining deep recurrent LSTM networks with new samples. CF has recently received renewed attention in the case of feed-forward DNNs, and this article is the first work that aims to rigorously establish whether deep LSTM networks are afflicted by CF as well, and to what degree. In order to test this fully, training is conducted using a wide variety of high-dimensional image-based sequence classification tasks derived from established visual classification benchmarks (MNIST, Devanagari, FashionMNIST and EMNIST). We find that the CF effect occurs universally, without exception, for deep LSTM-based sequence classifiers, regardless of the construction and provenance of sequences. This leads us to conclude that LSTMs, just like DNNs, are fully affected by CF, and that further research work needs to be conducted in order to determine how to avoid this effect (which is not a goal of this study).
引用
收藏
页码:714 / 728
页数:15
相关论文
共 50 条
  • [1] Catastrophic Forgetting in Deep Graph Networks: A Graph Classification Benchmark
    Carta, Antonio
    Cossu, Andrea
    Errica, Federico
    Bacciu, Davide
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [2] Catastrophic forgetting in connectionist networks
    French, RM
    TRENDS IN COGNITIVE SCIENCES, 1999, 3 (04) : 128 - 135
  • [3] Overcoming catastrophic forgetting in neural networks
    Kirkpatricka, James
    Pascanu, Razvan
    Rabinowitz, Neil
    Veness, Joel
    Desjardins, Guillaume
    Rusu, Andrei A.
    Milan, Kieran
    Quan, John
    Ramalho, Tiago
    Grabska-Barwinska, Agnieszka
    Hassabis, Demis
    Clopath, Claudia
    Kumaran, Dharshan
    Hadsell, Raia
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (13) : 3521 - 3526
  • [4] Measuring Catastrophic Forgetting in Neural Networks
    Kemker, Ronald
    McClure, Marc
    Abitino, Angelina
    Hayes, Tyler L.
    Kanan, Christopher
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3390 - 3398
  • [5] Catastrophic Forgetting in Deep Learning: A Comprehensive Taxonomy
    Aleixo, Everton Lima
    Colonna, Juan G.
    Cristo, Marco
    Fernandes, Everlandio
    Journal of the Brazilian Computer Society, 2024, 30 (01) : 175 - 211
  • [6] Overcoming Catastrophic Forgetting in Graph Neural Networks
    Liu, Huihui
    Yang, Yiding
    Wang, Xinchao
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8653 - 8661
  • [7] Generalisable deep Learning framework to overcome catastrophic forgetting
    Alammar, Zaenab
    Alzubaidi, Laith
    Zhang, Jinglan
    Li, Yuefeng
    Gupta, Ashish
    Gu, Yuantong
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2024, 23
  • [8] Catastrophic forgetting in simple networks: an analysis of the pseudorehearsal solution
    Frean, M
    Robins, A
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 1999, 10 (03) : 227 - 236
  • [9] Generalization and catastrophic forgetting in radial basis function networks
    Middleton, N
    PROCEEDINGS OF THE NINETEENTH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 1997, : 993 - 993
  • [10] Unsupervised Learning to Overcome Catastrophic Forgetting in Neural Networks
    Munoz-Martin, Irene
    Bianchi, Stefano
    Pedretti, Giacomo
    Melnic, Octavian
    Ambrogio, Stefano
    Ielmini, Daniele
    IEEE JOURNAL ON EXPLORATORY SOLID-STATE COMPUTATIONAL DEVICES AND CIRCUITS, 2019, 5 (01): : 58 - 66