Lifelong Sequence Generation with Dynamic Module Expansion and Adaptation

被引:0
|
作者
Qin, Chengwei [1 ]
Chen, Chen [1 ]
Joty, Shafiq [1 ,2 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Salesforce AI, Singapore, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lifelong sequence generation (LSG), a problem in continual learning, aims to continually train a model on a sequence of generation tasks to learn constantly emerging new generation patterns while avoiding the forgetting of previous knowledge. Existing LSG methods mainly focus on maintaining old knowledge while paying little attention to knowledge transfer across tasks. In contrast, humans can better learn new tasks by leveraging previously acquired knowledge from similar tasks. Inspired by the learning paradigm of humans, we propose Dynamic Module Expansion and Adaptation ( DMEA), which enables the model to dynamically determine the architecture for acquiring new knowledge based on task correlation and select the most similar previous tasks to facilitate adaptation to new tasks. In addition, as the learning process can easily be biased towards the current task which might cause more severe forgetting of previously learned knowledge, we propose dynamic gradient scaling to balance the learning of the current task and replayed tasks. With extensive experiments, we demonstrate that DMEA can consistently outperform existing methods in different LSG settings.
引用
收藏
页码:6701 / 6714
页数:14
相关论文
共 50 条
  • [11] Constitutional Adaptation of Dynamic Polymers: Hydrophobically Driven Sequence Selection in Dynamic Covalent Polyacylhydrazones
    Folmer-Andersen, J. Frantz
    Lehn, Jean-Marie
    ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2009, 48 (41) : 7664 - 7667
  • [12] Accurate module name prediction using similarity based and sequence generation models
    Sawan Rai
    Ramesh Chandra Belwal
    Atul Gupta
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 11531 - 11543
  • [13] Accurate module name prediction using similarity based and sequence generation models
    Rai, Sawan
    Belwal, Ramesh Chandra
    Gupta, Atul
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 14 (9) : 11531 - 11543
  • [14] MODULE GENERATION
    BRUNS, W
    COMMUNICATIONS IN ALGEBRA, 1976, 4 (04) : 341 - 373
  • [15] ON MODULE GENERATION
    MEYER, EL
    VLSI SYSTEMS DESIGN, 1987, 8 (03): : 48 - &
  • [16] Support for Dynamic Adaptation in Next Generation Packet Processing Systems
    Wu, Qiang
    Wolf, Tilman
    2009 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-8, 2009, : 2330 - 2335
  • [17] Proposing a Dynamic Tender Mechanism for Distributed Generation Expansion Planning
    Sahraei, Yusef
    Pahlavanhoseini, Afshin
    Sepasian, Mohammad Sadegh
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2023, 47 (02) : 733 - 752
  • [18] DYNAMIC PROGRAMMING FOR EXPANSION PLANNING OF NUCLEAR POWER GENERATION SYSTEMS
    LARSON, RE
    REES, FJ
    NICHOLS, JP
    TRANSACTIONS OF THE AMERICAN NUCLEAR SOCIETY, 1970, 13 (01): : 37 - &
  • [19] Exploring Dynamic Selection of Branch Expansion Orders for Code Generation
    Jiang, Hui
    Zhou, Chulun
    Meng, Fandong
    Zhang, Biao
    Zhou, Jie
    Huang, Degen
    Wu, Qingqiang
    Su, Jinsong
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 5076 - 5085
  • [20] Proposing a Dynamic Tender Mechanism for Distributed Generation Expansion Planning
    Yusef sahraei
    Afshin Pahlavanhoseini
    Mohammad Sadegh Sepasian
    Iranian Journal of Science and Technology, Transactions of Electrical Engineering, 2023, 47 : 733 - 752