Overfitting measurement of convolutional neural networks using trained network weights

被引:3
|
作者
Watanabe, Satoru [1 ]
Yamana, Hayato [2 ]
机构
[1] Waseda Univ, Dept Comp Sci & Commun Engn, Shinjuku Ku, 3-4-1 Okubo, Tokyo 1698555, Japan
[2] Waseda Univ, Fac Sci & Engn, Shinjuku Ku, 3-4-1 Okubo, Tokyo 1698555, Japan
关键词
Convolutional neural network; Overfitting; Persistent homology; Topological data analysis; PERSISTENT HOMOLOGY;
D O I
10.1007/s41060-022-00332-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Overfitting reduces the generalizability of convolutional neural networks (CNNs). Overfitting is generally detected by comparing the accuracies and losses of the training and validation data, where the validation data are formed from a portion of the training data; however, detection methods are ineffective for pretrained networks distributed without the training data. Thus, in this paper, we propose a method to detect overfitting of CNNs using the trained network weights inspired by the dropout technique. The dropout technique has been employed to prevent CNNs from overfitting, where the neurons in the CNNs are invalidated randomly during their training. It has been hypothesized that this technique prevents CNNs from overfitting by restraining the co-adaptations among neurons, and this hypothesis implies that the overfitting of CNNs results from co-adaptations among neurons and can be detected by investigating the inner representation of CNNs. The proposed persistent homology-based overfitting measure (PHOM) method constructs clique complexes in CNNs using the trained network weights, and the one-dimensional persistent homology investigates co-adaptations among neurons. In addition, we enhance PHOM to normalized PHOM (NPHOM) to mitigate fluctuation in PHOM caused by the difference in network structures. We applied the proposed methods to convolutional neural networks trained for the classification problems on the CIFAR-10, street view house number, Tiny ImageNet, and CIFAR-100 datasets. Experimental results demonstrate that PHOM and NPHOM can indicate the degree of overfitting of CNNs, which suggests that these methods enable us to filter overfitted CNNs without requiring the training data.
引用
收藏
页码:261 / 278
页数:18
相关论文
共 50 条
  • [21] Development of a deep learning network using a pre-trained convolutional neural network
    Rooney, M.
    Mitchell, J.
    McLaren, D. B.
    Nailon, W. H.
    RADIOTHERAPY AND ONCOLOGY, 2019, 133 : S1051 - S1052
  • [22] BROADBAND DOA ESTIMATION USING CONVOLUTIONAL NEURAL NETWORKS TRAINED WITH NOISE SIGNALS
    Chakrabarty, Soumitro
    Habets, Emanuel A. P.
    2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 136 - 140
  • [23] Fast and convergent iterative image recovery using trained convolutional neural networks
    Chun, Il Yong
    Lim, Hongki
    Huang, Zhengyu
    Fessler, Jeffrey A.
    2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 155 - 159
  • [24] Classification of Deepfake Videos Using Pre-trained Convolutional Neural Networks
    Masood, MomMa
    Nawaz, Marriam
    Javed, Ali
    Nazir, Tahira
    Mehmood, Awais
    Mahum, Rabbia
    2021 INTERNATIONAL CONFERENCE ON DIGITAL FUTURES AND TRANSFORMATIVE TECHNOLOGIES (ICODT2), 2021,
  • [25] Automatic variogram inference using pre-trained Convolutional Neural Networks
    Karim, Mokdad
    Behrang, Koushavand
    Jeff, Boisvert
    APPLIED COMPUTING AND GEOSCIENCES, 2025, 25
  • [26] Impact of random weights on nonlinear system identification using convolutional neural networks
    Yu, Wen
    Pacheco, Mario
    INFORMATION SCIENCES, 2019, 477 : 1 - 14
  • [27] Drone Model Classification Using Convolutional Neural Network Trained on Synthetic Data
    Wisniewski, Mariusz
    Rana, Zeeshan A.
    Petrunin, Ivan
    JOURNAL OF IMAGING, 2022, 8 (08)
  • [28] Fish species identification using a convolutional neural network trained on synthetic data
    Allken, Vaneeda
    Handegard, Nils Olav
    Rosen, Shale
    Schreyeck, Tiffanie
    Mahiout, Thomas
    Malde, Ketil
    ICES JOURNAL OF MARINE SCIENCE, 2019, 76 (01) : 342 - 349
  • [29] Underwater sonar image classification using adaptive weights convolutional neural network
    Wang, Xingmei
    Jiao, Jia
    Yin, Jingwei
    Zhao, Wensheng
    Han, Xiao
    Sun, Boxuan
    APPLIED ACOUSTICS, 2019, 146 : 145 - 154
  • [30] Overfitting One-Dimensional convolutional neural networks for Raman spectra identification
    Mozaffari, M. Hamed
    Tay, Li-Lin
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2022, 272