Toward Deep Adaptive Hinging Hyperplanes

被引:0
|
作者
Tao, Qinghua [1 ,2 ]
Xu, Jun [3 ]
Li, Zhen [3 ]
Xie, Na [4 ]
Wang, Shuning [2 ]
Li, Xiaoli [5 ]
Suykens, Johan A. K. [1 ]
机构
[1] Katholieke Univ Leuven, STADIUS, ESAT, B-3001 Leuven, Belgium
[2] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[3] Harbin Inst Technol, Sch Mech Engn & Automat, Shenzhen 518055, Peoples R China
[4] Cent Univ Finance & Econ, Sch Management Sci & Engn, Beijing 100081, Peoples R China
[5] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
基金
中国国家自然科学基金; 欧洲研究理事会;
关键词
Neurons; Artificial neural networks; Network topology; Training; Topology; Optimization; Adaptive systems; Adaptive hinging hyperplanes (AHHs); analysis of variance (ANOVA) decomposition; domain partition; piecewise linear (PWL); skip-layer connection; REGRESSION; SELECTION;
D O I
10.1109/TNNLS.2021.3079113
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The adaptive hinging hyperplane (AHH) model is a popular piecewise linear representation with a generalized tree structure and has been successfully applied in dynamic system identification. In this article, we aim to construct the deep AHH (DAHH) model to extend and generalize the networking of AHH model for high-dimensional problems. The network structure of DAHH is determined through a forward growth, in which the activity ratio is introduced to select effective neurons and no connecting weights are involved between the layers. Then, all neurons in the DAHH network can be flexibly connected to the output in a skip-layer format, and only the corresponding weights are the parameters to optimize. With such a network framework, the backpropagation algorithm can be implemented in DAHH to efficiently tackle large-scale problems and the gradient vanishing problem is not encountered in the training of DAHH. In fact, the optimization problem of DAHH can maintain convexity with convex loss in the output layer, which brings natural advantages in optimization. Different from the existing neural networks, DAHH is easier to interpret, where neurons are connected sparsely and analysis of variance (ANOVA) decomposition can be applied, facilitating to revealing the interactions between variables. A theoretical analysis toward universal approximation ability and explicit domain partitions are also derived. Numerical experiments verify the effectiveness of the proposed DAHH.
引用
收藏
页码:6373 / 6387
页数:15
相关论文
共 50 条
  • [1] Fast Adaptive Hinging Hyperplanes
    Tao, Qinghua
    Xu, Jun
    Suykens, Johan A. K.
    Wang, Shuning
    [J]. 2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 1482 - 1487
  • [2] Optimization based on adaptive hinging hyperplanes and genetic algorithm
    Xu, Jun
    Xi, Xiangming
    Wang, Shuning
    [J]. 2014 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2014, : 2040 - 2046
  • [3] Generalization of hinging hyperplanes
    Wang, SN
    Sun, XS
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2005, 51 (12) : 4425 - 4431
  • [4] Adaptive hinging hyperplanes and its applications in dynamic system identification
    Xu, Jun
    Huang, Xiaolin
    Wang, Shuning
    [J]. AUTOMATICA, 2009, 45 (10) : 2325 - 2332
  • [5] Model predictive control based on adaptive hinging hyperplanes model
    Xu, Jun
    Huang, Xiaolin
    Mu, Xiaomu
    Wang, Shuning
    [J]. JOURNAL OF PROCESS CONTROL, 2012, 22 (10) : 1821 - 1831
  • [6] Urban Traffic Flow Forecasting Based on Adaptive Hinging Hyperplanes
    Lu, Yang
    Hu, Jianming
    Xu, Jun
    Wang, Shuning
    [J]. ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PROCEEDINGS, 2009, 5855 : 658 - 667
  • [7] An Improved Coordinate Update Method for the Identification of Adaptive Hinging Hyperplanes Model
    Tao, Qinghua
    Xu, Jun
    Wang, Shuning
    Li, Li
    [J]. PROCEEDINGS OF 2018 10TH INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2018), 2018, : 102 - 106
  • [8] Nonlinear Model Predictive Control Using Adaptive Hinging Hyperplanes Model
    Xu, Jun
    Huang, Xiaolin
    Wang, Shuning
    [J]. PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 2598 - 2603
  • [9] Hinging Hyperplanes for Time-Series Segmentation
    Huang, Xiaolin
    Matijas, Marin
    Suykens, Johan A. K.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (08) : 1279 - 1291
  • [10] On the hinge-finding algorithm for hinging hyperplanes
    Pucar, P
    Sjoberg, J
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (03) : 1310 - 1319