Information Bottleneck Theory Based Exploration of Cascade Learning

被引:1
|
作者
Du, Xin [1 ]
Farrahi, Katayoun [1 ]
Niranjan, Mahesan [1 ]
机构
[1] Univ Southampton, Sch Elect & Comp Sci, Southampton SO17 3AS, Hants, England
基金
英国工程与自然科学研究理事会;
关键词
information bottleneck theory; Cascade Learning; neural networks;
D O I
10.3390/e23101360
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In solving challenging pattern recognition problems, deep neural networks have shown excellent performance by forming powerful mappings between inputs and targets, learning representations (features) and making subsequent predictions. A recent tool to help understand how representations are formed is based on observing the dynamics of learning on an information plane using mutual information, linking the input to the representation (I(X;T)) and the representation to the target (I(T;Y)). In this paper, we use an information theoretical approach to understand how Cascade Learning (CL), a method to train deep neural networks layer-by-layer, learns representations, as CL has shown comparable results while saving computation and memory costs. We observe that performance is not linked to information-compression, which differs from observation on End-to-End (E2E) learning. Additionally, CL can inherit information about targets, and gradually specialise extracted features layer-by-layer. We evaluate this effect by proposing an information transition ratio, I(T;Y)/I(X;T), and show that it can serve as a useful heuristic in setting the depth of a neural network that achieves satisfactory accuracy of classification.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Information Bottleneck Theory on Convolutional Neural Networks
    Junjie Li
    Ding Liu
    Neural Processing Letters, 2021, 53 : 1385 - 1400
  • [22] Dynamic Encoding and Decoding of Information for Split Learning in Mobile-Edge Computing: Leveraging Information Bottleneck Theory
    Alhussein, Omar
    Wei, Moshi
    Akhavain, Arashmid
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 4625 - 4631
  • [23] Information Bottleneck Approach to Spatial Attention Learning
    Lai, Qiuxia
    Li, Yu
    Zeng, Ailing
    Liu, Minhao
    Sun, Hanqiu
    Xu, Qiang
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 779 - 785
  • [24] Successive Information Bottleneck and Applications in Deep Learning
    Yousfi, Yassine
    Akyol, Emrah
    2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 1210 - 1213
  • [25] Robust Few-Label Misinformation Detection Based on Information Bottleneck Theory
    Wang, Jihong
    Zhao, Shuqing
    Luo, Minnan
    Liu, Huan
    Zhao, Xiang
    Zheng, Qinghua
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (07): : 1629 - 1642
  • [26] Applying the information bottleneck to statistical relational learning
    Fabrizio Riguzzi
    Nicola Di Mauro
    Machine Learning, 2012, 86 : 89 - 114
  • [27] Disentangled Representation Learning with Transmitted Information Bottleneck
    Dang, Zhuohang
    Luo, Minnan
    Jia, Chengyou
    Dai, Guang
    Wang, Jihong
    Chang, Xiaojun
    Wang, Jingdong
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (12) : 13297 - 13310
  • [28] Information Bottleneck in Deep Learning - A Semiotic Approach
    Musat, B.
    Andonie, R.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2022, 17 (01)
  • [29] Graph Structure Learning with Variational Information Bottleneck
    Sun, Qingyun
    Li, Jianxin
    Peng, Hao
    Wu, Jia
    Fu, Xingcheng
    Ji, Cheng
    Yu, Philip S.
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 4165 - 4174
  • [30] Federated Learning via Disentangled Information Bottleneck
    Uddin, Md Palash
    Xiang, Yong
    Lu, Xuequan
    Yearwood, John
    Gao, Longxiang
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (03) : 1874 - 1889