A novel scheme to determine the architecture of a Multi-layer Perceptron

被引:0
|
作者
Chintalapudi, KK [1 ]
Pal, NR [1 ]
机构
[1] Drexel Univ, Dept Elect & Comp Engn, Philadelphia, PA 19104 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a method for optimizing the architecture, of a Multilayer Perceptron (MLP) network. The proposed scheme is a variation of the MLP architecture, in which each neuron's output is modulated by an efficiency factor associated with that node. Nodes with low efficiencies (efficiency factors) literally do not participate in the network. We compute the efficiency of a node using a multiplier function with a learnable parameter which we call as the multiplier of that node. Values of the multipliers are learned by a gradient descent along with the weights, aiming to minimize the mean square error (MSE). Training starts with all node efficiencies set very low so that there is literally no connection between any of the neurons in the net. As the learning progresses, gradually some of the nodes start acquiring high efficiencies. Training is terminated when performance of the network is satisfactory. At the end of the training, nodes with low efficiency are eliminated and a near optimal architectural size fdr the MLP is obtained. Effectiveness of the proposed scheme is demonstrated on several data-sets.
引用
收藏
页码:2297 / 2302
页数:6
相关论文
共 50 条
  • [1] Multi-layer perceptron mapping on a SIMD architecture
    Vitabile, S
    Gentile, A
    Dammone, GB
    Sorbello, F
    [J]. NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, 2002, : 667 - 675
  • [2] An efficient implementation of multi-layer perceptron on mesh architecture
    Ayoubi, RA
    Bayoumi, MA
    [J]. 2002 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, PROCEEDINGS, 2002, : 109 - 112
  • [3] A Novel Network Delay Prediction Model with Mixed Multi-layer Perceptron Architecture for Edge Computing
    Fang, Honglin
    Yu, Peng
    Wang, Ying
    Li, Wenjing
    Zhou, Fanqin
    Ma, Run
    [J]. 2022 18TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM 2022): INTELLIGENT MANAGEMENT OF DISRUPTIVE NETWORK TECHNOLOGIES AND SERVICES, 2022, : 191 - 197
  • [4] Multi-layer Perceptron Architecture for Kinect-Based Gait Recognition
    Bari, A. S. M. Hossain
    Gavrilova, Marina L.
    [J]. ADVANCES IN COMPUTER GRAPHICS, CGI 2019, 2019, 11542 : 356 - 363
  • [5] Graph Attention Multi-Layer Perceptron
    Zhang, Wentao
    Yin, Ziqi
    Sheng, Zeang
    Li, Yang
    Ouyang, Wen
    Li, Xiaosen
    Tao, Yangyu
    Yang, Zhi
    Cui, Bin
    [J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4560 - 4570
  • [6] Symbolic representation of a multi-layer perceptron
    Mouria-Beji, F
    [J]. ARTIFICIAL NEURAL NETS AND GENETIC ALGORITHMS, 2001, : 205 - 208
  • [7] Multi-Layer Perceptron Classifier and Paillier Encryption Scheme for Friend Recommendation System
    Patil, Kaustubh
    Jadhav, Nagesh
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2017,
  • [8] A novel variable selection algorithm for multi-layer perceptron with elastic net
    Zhang, Fangfang
    Sun, Kai
    Wu, Xiuliang
    [J]. NEUROCOMPUTING, 2019, 361 : 110 - 118
  • [9] Tighter Guarantees for the Compressive Multi-layer Perceptron
    Kaban, Ata
    Thummanusarn, Yamonporn
    [J]. THEORY AND PRACTICE OF NATURAL COMPUTING (TPNC 2018), 2018, 11324 : 388 - 400
  • [10] Multi-Layer Perceptron with Pulse Glial Chain
    Ikuta, Chihiro
    Uwate, Yoko
    Nishio, Yoshifumi
    Yang, Guoan
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2016, E99A (03): : 742 - 755