Polyhedron Attention Module: Learning Adaptive-order Interactions

被引:0
|
作者
Zhu, Tan [1 ]
Dou, Fei [2 ]
Wang, Xinyu [1 ]
Lu, Jin [2 ]
Bi, Jinbo [1 ]
机构
[1] Univ Connecticut, Mansfield, CT USA
[2] Univ Georgia, Athens, GA USA
关键词
AGE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning feature interactions can be the key for multivariate predictive modeling. ReLU-activated neural networks create piecewise linear prediction models. Other nonlinear activation functions lead to models with only high-order feature interactions, thus lacking of interpretability. Recent methods incorporate candidate polynomial terms of fixed orders into deep learning, which is subject to the issue of combinatorial explosion, or learn the orders that are difficult to adapt to different regions of the feature space. We propose a Polyhedron Attention Module (PAM) to create piecewise polynomial models where the input space is split into polyhedrons which define the different pieces and on each piece the hyperplanes that define the polyhedron boundary multiply to form the interactive terms, resulting in interactions of adaptive order to each piece. PAM is interpretable to identify important interactions in predicting a target. Theoretic analysis shows that PAM has stronger expression capability than ReLU-activated networks. Extensive experimental results demonstrate the superior classification performance of PAM on massive datasets of the click-through rate prediction and PAM can learn meaningful interaction effects in a medical problem.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Adaptive Factorization Network: Learning Adaptive-Order Feature Interactions
    Cheng, Weiyu
    Shen, Yanyan
    Huang, Linpeng
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3609 - 3616
  • [2] Adaptive-order proximity learning for graph-based clustering
    Wu, Danyang
    Chang, Wei
    Lu, Jitao
    Nie, Feiping
    Wang, Rong
    Li, Xuelong
    [J]. PATTERN RECOGNITION, 2022, 126
  • [3] Adaptive-order Polynomial Methods for Power Amplifier Model Estimation
    Dvorak, Jiri
    Marsalek, Roman
    Blumenstein, Jiri
    [J]. 2013 23RD INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2013, : 389 - 392
  • [4] Adaptive-Order Fractional Fourier Transform Features for Speech Recognition
    Yin Hui
    Xie Xiang
    Kuang Jingming
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 654 - 657
  • [5] A heuristic adaptive-order intuitionistic fuzzy time series forecasting model
    Wang, Yanan
    Lei, Yingjie
    Wang, Yi
    Zheng, Kouquan
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2016, 38 (11): : 2795 - 2802
  • [6] Adaptive-order rational Arnoldi-type methods in computational electromagnetism
    André Bodendiek
    Matthias Bollhöfer
    [J]. BIT Numerical Mathematics, 2014, 54 : 357 - 380
  • [7] Adaptive-order rational Arnoldi-type methods in computational electromagnetism
    Bodendiek, Andre
    Bollhoefer, Matthias
    [J]. BIT NUMERICAL MATHEMATICS, 2014, 54 (02) : 357 - 380
  • [8] SAR minimum-entropy autofocus using an adaptive-order polynomial model
    Wang, Junfeng
    Liu, Xingzhao
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2006, 3 (04) : 512 - 516
  • [9] Adaptive-Order Regression-Based MR Image Super-Resolution
    Hu, Jing
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SENSING AND IMAGING, 2019, 506 : 261 - 268
  • [10] An adaptive-order discontinuous Galerkin method for the solution of the Euler equations of gas dynamics
    Baumann, CE
    Oden, JT
    [J]. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2000, 47 (1-3) : 61 - 73