How does Weight Correlation Affect the Generalisation Ability of Deep Neural Networks?

被引:0
|
作者
Jin, Gaojie [1 ]
Yi, Xinping [1 ]
Zhang, Liang [2 ,3 ]
Zhang, Lijun [2 ,4 ]
Schewe, Sven [1 ]
Huang, Xiaowei [1 ]
机构
[1] Univ Liverpool, Liverpool, England
[2] Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Inst Intellegence Software, Guangzhou, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies the novel concept of weight correlation in deep neural networks and discusses its impact on the networks' generalisation ability. For fully-connected layers, the weight correlation is defined as the average cosine similarity between weight vectors of neurons, and for convolutional layers, the weight correlation is defined as the cosine similarity between filter matrices. Theoretically, we show that, weight correlation can, and should, be incorporated into the PAC Bayesian framework for the generalisation of neural networks, and the resulting generalisation bound is monotonic with respect to the weight correlation. We formulate a new complexity measure, which lifts the PAC Bayes measure with weight correlation, and experimentally confirm that it is able to rank the generalisation errors of a set of networks more precisely than existing measures. More importantly, we develop a new regulariser for training, and provide extensive experiments that show that the generalisation error can be greatly reduced with our novel approach.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Improving the Generalisation Ability of Neural Networks Using a Lévy Flight Distribution Algorithm for Classification Problems
    Ehsan Bojnordi
    Seyed Jalaleddin Mousavirad
    Mahdi Pedram
    Gerald Schaefer
    Diego Oliva
    New Generation Computing, 2023, 41 : 225 - 242
  • [32] How does degree heterogeneity affect nucleation on complex networks?
    Chen, Hanshuang
    Li, Shuxian
    Hou, Zhonghuai
    He, Gang
    Huang, Feng
    Shen, Chuansheng
    JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2013,
  • [33] Affect Classification in Tweets using Multitask Deep Neural Networks
    Nagar, Seema
    Shankhdhar, Achintya
    Barbhuiya, Ferdous Ahmed
    Dey, Kuntal
    WEB CONFERENCE 2021: COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2021), 2021, : 516 - 520
  • [34] Contextual modulation of affect: Comparing humans and deep neural networks
    Shin, Soomin
    Kim, Doo Yon
    Wallraven, Christian
    COMPANION PUBLICATION OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2022, 2022, : 127 - 133
  • [35] Norm-Based Generalisation Bounds for Deep Multi-Class Convolutional Neural Networks
    Ledent, Antoine
    Mustafa, Waleed
    Lei, Yunwen
    Kloft, Marius
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8279 - 8287
  • [36] Obstetrics How does low Birth Weight affect Health Expenditures?
    Spyra, Anna
    GESUNDHEITSOEKONOMIE UND QUALITAETSMANAGEMENT, 2014, 19 (04):
  • [37] HOW DOES DMPA USE AFFECT POSTPARTUM WEIGHT RETENTION IN ADOLESCENTS?
    Conroy, Erin
    Patchen, Loral
    JOURNAL OF ADOLESCENT HEALTH, 2010, 46 (02) : S55 - S55
  • [38] How does maternal height and different weight factors affect the newborn?
    Paredes Lascano, P.
    Minaca, Calle
    BOLETIN DE PEDIATRIA, 2011, 51 (215): : 53 - 59
  • [39] How does COVID-19 affect people's ability to smell?
    不详
    MEDICAL JOURNAL OF AUSTRALIA, 2022, 216 (07) : 326 - 326
  • [40] How Does Correlation Affect the Capacity of MIMO Systems with Rate Constraints?
    Wang, Hao
    Wang, Peng
    Ping, Li
    Lin, Xiaokang
    GLOBECOM 2009 - 2009 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-8, 2009, : 3811 - +