Mutual Information Dropout: Mutual Information Can Be All You Need

被引:0
|
作者
Song, Zichen [1 ]
Ma, Shan [1 ]
机构
[1] Lanzhou Univ, Lanzhou 730000, Peoples R China
关键词
Dropout; Mutual Information; Generalization Ability;
D O I
10.1007/978-3-031-44201-8_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dropout is a powerful way for preventing model overfitting. However, it is inefficient due to it randomly ignoring some neurons. Although there are many ways on Dropout, they are still either inefficient on improving generalization ability or not effective enough. In this paper, we propose Mutual Information Dropout, which is an efficient Dropout based on dropping neurons with low mutual information. In Mutual Information Dropout, instead of randomly ignoring some neurons, we first evaluated the mutual information of neurons to dropout with mutual information below a certain threshold. In this way, Mutual Information Dropout can achieve effective improving generalization ability with evaluate neurons. Extensive experiments on Three datasets show that Mutual Information Dropout is much more efficient than many existing Dropout and can meanwhile achieve comparable or even better generalization ability.
引用
收藏
页码:91 / 101
页数:11
相关论文
共 50 条
  • [1] α-Mutual Information
    Verdu, Sergio
    [J]. 2015 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2015, : 1 - 6
  • [2] The feeling is mutual - Mutual funds information sources
    Hartmann, J
    [J]. ECONTENT, 2000, 23 (03) : 56 - +
  • [3] Hashing with Mutual Information
    Cakir, Fatih
    He, Kun
    Bargal, Sarah Adel
    Sclaroff, Stan
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (10) : 2424 - 2437
  • [4] Distribution of mutual information
    Hutter, M
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 399 - 406
  • [5] Fisher Information and Mutual Information Constraints
    Barnes, Leighton Pate
    Ozgur, Ayfer
    [J]. 2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 2179 - 2184
  • [6] The kernel mutual information
    Gretton, A
    Herbrich, R
    Smola, AJ
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS: SIGNAL PROCESSING FOR COMMUNICATIONS SPECIAL SESSIONS, 2003, : 880 - 883
  • [7] On Causality and Mutual Information
    Solo, Victor
    [J]. 47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 4939 - 4944
  • [8] The mutual information constellation
    Nilsson, A
    Aulin, TM
    [J]. PROCEEDINGS OF THE IEEE ITSOC INFORMATION THEORY WORKSHOP 2005 ON CODING AND COMPLEXITY, 2005, : 152 - 156
  • [9] ON CALCULATION OF MUTUAL INFORMATION
    DUNCAN, TE
    [J]. SIAM JOURNAL ON APPLIED MATHEMATICS, 1970, 19 (01) : 215 - &
  • [10] Distribution of mutual information
    Abarbanel, HDI
    Masuda, N
    Rabinovich, MI
    Tumer, E
    [J]. PHYSICS LETTERS A, 2001, 281 (5-6) : 368 - 373