Exact learning dynamics of deep linear networks with prior knowledge

被引:0
|
作者
Braun, Lukas [1 ]
Domine, Clementine C. J. [2 ]
Fitzgerald, James E. [3 ]
Saxe, Andrew M. [2 ,4 ,5 ]
机构
[1] Univ Oxford, Dept Expt Psychol, Oxford, England
[2] UCL, Gatsby Computat Neurosci Unit, London, England
[3] Janelia Res Campus, Howard Hughes Med Inst, Ashburn, VA USA
[4] UCL, Sainsbury Wellcome Ctr, London, England
[5] CIFAR, Toronto, ON, Canada
基金
英国惠康基金; 英国医学研究理事会;
关键词
CONNECTIONIST MODELS; NEURAL-NETWORKS; SYSTEMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning in deep neural networks is known to depend critically on the knowledge embedded in the initial network weights. However, few theoretical results have precisely linked prior knowledge to learning dynamics. Here we derive exact solutions to the dynamics of learning with rich prior knowledge in deep linear networks by generalising Fukumizu's matrix Riccati solution [1]. We obtain explicit expressions for the evolving network function, hidden representational similarity, and neural tangent kernel over training for a broad class of initialisations and tasks. The expressions reveal a class of task-independent initialisations that radically alter learning dynamics from slow non-linear dynamics to fast exponential trajectories while converging to a global optimum with identical representational similarity, dissociating learning trajectories from the structure of initial internal representations. We characterise how network weights dynamically align with task structure, rigorously justifying why previous solutions successfully described learning from small initial weights without incorporating their fine-scale structure. Finally, we discuss the implications of these findings for continual learning, reversal learning and learning of structured knowledge. Taken together, our results provide a mathematical toolkit for understanding the impact of prior knowledge on deep learning.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Image enhancement for fluorescence microscopy based on deep learning with prior knowledge of aberration
    Hu, Lejia
    Hu, Shuwen
    Gong, Wei
    Si, Ke
    OPTICS LETTERS, 2021, 46 (09) : 2055 - 2058
  • [42] Fast training method of deep-learning model fused with prior knowledge
    Wang P.
    He M.
    Wang H.
    Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2021, 42 (04): : 561 - 566
  • [43] Path planning in an unknown environment based on deep reinforcement learning with prior knowledge
    Lou, Ping
    Xu, Kun
    Jiang, Xuemei
    Xiao, Zheng
    Yan, Junwei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 5773 - 5789
  • [44] Deep Human Dynamics Prior
    Cui, Qiongjie
    Sun, Huaijiang
    Kong, Yue
    Sun, Xiaoning
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4371 - 4379
  • [45] Incorporating Prior Scientific Knowledge Into Deep Learning for Precipitation Nowcasting on Radar Images
    Danpoonkij, Pattarapong
    Kleawsirikul, Nutnaree
    Leepaisomboon, Patamawadee
    Gaviphatt, Natnapat
    Sakaino, Hidetomo
    Vateekul, Peerapon
    2021 18TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE-2021), 2021,
  • [46] Asymptotic Expansion as Prior Knowledge in Deep Learning Method for High dimensional BSDEs
    Masaaki Fujii
    Akihiko Takahashi
    Masayuki Takahashi
    Asia-Pacific Financial Markets, 2019, 26 : 391 - 408
  • [47] Knowledge Distillation with Attention for Deep Transfer Learning of Convolutional Networks
    Li, Xingjian
    Xiong, Haoyi
    Chen, Zeyu
    Huan, Jun
    Liu, Ji
    Xu, Cheng-Zhong
    Dou, Dejing
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (03)
  • [48] Spiking neural networks for deep learning and knowledge representation: Editorial
    Kasabov, Nikola K.
    NEURAL NETWORKS, 2019, 119 : 341 - 342
  • [49] Knowledge graph learning algorithm based on deep convolutional networks
    Zhou, Yuzhong
    Lin, Zhengping
    Lin, Jie
    Yang, Yuliang
    Shi, Jiahao
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2024, 22
  • [50] Penalized PET Reconstruction Using Deep Learning Prior and Local Linear Fitting
    Kim, Kyungsang
    Wu, Dufan
    Gong, Kuang
    Dutta, Joyita
    Kim, Jong Hoon
    Son, Young Don
    Kim, Hang Keun
    El Fakhri, Georges
    Li, Quanzheng
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (06) : 1478 - 1487