Nonlinear Feature Selection Neural Network via Structured Sparse Regularization

被引:2
|
作者
Wang, Rong [1 ,2 ]
Bian, Jintang [1 ,2 ,3 ]
Nie, Feiping [1 ,2 ,3 ]
Li, Xuelong [1 ,2 ]
机构
[1] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect iOPEN, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Key Lab Intelligent Interact & Applicat, Minist Ind & Informat Technol, Xian 710072, Peoples R China
[3] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Classification; neural network; nonlinear fea-ture selection; structured sparsity regularization; supervised learning; REPRESENTATION; REGRESSION;
D O I
10.1109/TNNLS.2022.3209716
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is an important and effective data preprocessing method, which can remove the noise and redundant features while retaining the relevant and discriminative features in high-dimensional data. In real-world applications, the relationships between data samples and their labels are usually nonlinear. However, most of the existing feature selection models focus on learning a linear transformation matrix, which cannot capture such a nonlinear structure in practice and will degrade the performance of downstream tasks. To address the issue, we propose a novel nonlinear feature selection method to select those most relevant and discriminative features in high-dimensional dataset. Specifically, our method learns the nonlinear structure of high-dimensional data by a neural network with cross entropy loss function, and then using the structured sparsity norm such as 12,p-norm to regularize the weights matrix connecting the input layer and the first hidden layer of the neural network model to learn weight of each feature. Therefore, a structural sparse weights matrix is obtained by conducting nonlinear learning based on a neural network with structured sparsity regularization. Then, we use the gradient descent method to achieve the optimal solution of the proposed model. Evaluating the experimental results on several synthetic datasets and real-world datasets shows the effectiveness and superiority of the proposed nonlinear feature selection model.
引用
收藏
页码:9493 / 9505
页数:13
相关论文
共 50 条
  • [21] Implicit Regularization via Neural Feature Alignment
    Baratin, Aristide
    George, Thomas
    Laurent, Cesar
    Hjelm, R. Devon
    Lajoie, Guillaume
    Vincent, Pascal
    Lacoste-Julien, Simon
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [22] Robust unsupervised feature selection via sparse and minimum-redundant subspace learning with dual regularization
    Zeng, Congying
    Chen, Hongmei
    Li, Tianrui
    Wan, Jihong
    NEUROCOMPUTING, 2022, 511 : 1 - 21
  • [23] Sparse PCA via l2,p-Norm Regularization for Unsupervised Feature Selection
    Li, Zhengxin
    Nie, Feiping
    Bian, Jintang
    Wu, Danyang
    Li, Xuelong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 5322 - 5328
  • [24] Feature selection via kernel sparse representation
    Lv, Zhizheng
    Li, Yangding
    Li, Jieye
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2637 - 2644
  • [25] Outliers Robust Unsupervised Feature Selection for Structured Sparse Subspace
    Wang, Sisi
    Nie, Feiping
    Wang, Zheng
    Wang, Rong
    Li, Xuelong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (03) : 1234 - 1248
  • [26] Supervised Feature Selection via Ensemble Gradient Information from Sparse Neural Networks
    Liu, Kaiting
    Atashgahi, Zahra
    Sokar, Ghada
    Pechenizkiy, Mykola
    Mocanu, Decebal Constantin
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [27] Automatic Feature Selection via Weighted Kernels and Regularization
    Allen, Genevera I.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2013, 22 (02) : 284 - 299
  • [28] Causal Network Inference Via Group Sparse Regularization
    Bolstad, Andrew
    Van Veen, Barry D.
    Nowak, Robert
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2011, 59 (06) : 2628 - 2641
  • [29] A feature selection method for TWSVM via a regularization technique
    Ye, Qiaolin
    Zhao, Chunxia
    Chen, Xiaobo
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2011, 48 (06): : 1029 - 1037
  • [30] Joint Adaptive Graph and Structured Sparsity Regularization for Unsupervised Feature Selection
    Sun, Zhenzhen
    Yu, Yuanlong
    arXiv, 2020,