CREATING ARTIFICIAL NEURAL NETWORKS THAT GENERALIZE

被引:346
|
作者
SIETSMA, J
DOW, RJF
机构
关键词
NEURAL NETWORKS; BACK-PROPAGATION; PATTERN RECOGNITION; GENERALIZATION; HIDDEN UNITS; PRUNING;
D O I
10.1016/0893-6080(91)90033-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop a technique to test the hypothesis that multilayered, feed-forward networks with few units on the first hidden layer generalize better than networks with many units in the first layer. Large networks are trained to perform a classification task and the redundant units are removed ("pruning") to produce the smallest network capable of performing the task. A technique for inserting layers where pruning has introduced linear inseparability is also described. Two tests of ability to generalize are used - the ability to classify training inputs corrupted by noise and the ability to classify new patterns from each class. The hypothesis is found to be false for networks trained with noisy inputs. Pruning to the minimum number of units in the first layer produces networks which correctly classify the training set but generalize poorly compared with larger networks.
引用
收藏
页码:67 / 79
页数:13
相关论文
共 50 条
  • [1] ON THE ABILITY OF NEURAL NETWORKS TO GENERALIZE BY INDUCTION
    ANSHELEVICH, VV
    AMIRIKYAN, BR
    LUKASHIN, AV
    FRANKKAMENETSKII, MD
    BIOFIZIKA, 1989, 34 (03): : 491 - 495
  • [2] Creating artificial human genomes using generative neural networks
    Yelmen, B.
    Decelle, A.
    Ongaro, L.
    Marnetto, D.
    Tallec, C.
    Montinaro, F.
    Furtlehner, C.
    Pagani, L.
    Jay, F.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2020, 28 (SUPPL 1) : 591 - 592
  • [3] Creating artificial human genomes using generative neural networks
    Yelmen, Burak
    Decelle, Aurelien
    Ongaro, Linda
    Marnetto, Davide
    Tallec, Corentin
    Montinaro, Francesco
    Furtlehner, Cyril
    Pagani, Luca
    Jay, Flora
    PLOS GENETICS, 2021, 17 (02):
  • [4] Modern Neural Networks Generalize on Small Data Sets
    Olson, Matthew
    Wyner, Abraham J.
    Berk, Richard
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [5] Task Discovery: Finding the Tasks that Neural Networks Generalize on
    Atanov, Andrei
    Filatov, Andrei
    Yeo, Teresa
    Sohmshetty, Ajay
    Zamir, Amir
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [6] Generalize Deep Neural Networks With Adaptive Regularization for Classifying
    Guo, Kehua
    Tao, Ze
    Zhang, Lingyan
    Hu, Bin
    Kui, Xiaoyan
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) : 1216 - 1229
  • [7] THE PROBLEMS OF CREATING EXPERT SYSTEMS USING ARTIFICIAL NEURAL NETWORKS AND THEIR USE IN MEDICINE
    Lin, Dmitry, I
    Burnashev, Rustam A.
    Enikeev, Arslan, I
    IIOAB JOURNAL, 2018, 9 (02) : 136 - 142
  • [8] When Neural Networks Fail to Generalize? A Model Sensitivity Perspective
    Zhang, Jiajin
    Chao, Hanqing
    Dhurandhar, Amit
    Chen, Pin-Yu
    Tajer, Ali
    Xu, Yangyang
    Yan, Pingkun
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 11219 - 11227
  • [9] Artificial life based on boids model and evolutionary chaotic neural networks for creating artworks
    Choi, Tae Jong
    Ahn, Chang Wook
    SWARM AND EVOLUTIONARY COMPUTATION, 2019, 47 : 80 - 88
  • [10] Artificial neural networks
    Piuri, V
    Alippi, C
    JOURNAL OF SYSTEMS ARCHITECTURE, 1998, 44 (08) : 565 - 567