Design of feature selection algorithm for high-dimensional network data based on supervised discriminant projection

被引:1
|
作者
Zhang, Zongfu [1 ]
Luo, Qingjia [2 ]
Ying, Zuobin [2 ]
Chen, Rongbin [1 ]
Chen, Hongan [1 ]
机构
[1] Jiangmen Polytech, Coll Informat Engn, Jiangmen, Peoples R China
[2] City Univ Macau, Fac Data Sci, Macau, Peoples R China
关键词
Supervised discriminant projection; Network high-dimensional data; Feature selection; Sparse subspace clustering; Sparse constraint; SYSTEM;
D O I
10.7717/peerj-cs.1447
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High dimension and complexity of network high-dimensional data lead to poor feature selection effect network high-dimensional data. To effectively solve this problem, feature selection algorithms for high-dimensional network data based on supervised discriminant projection (SDP) have been designed. The sparse representation problem of high-dimensional network data is transformed into an Lp norm optimization problem, and the sparse subspace clustering method is used to cluster high-dimensional network data. Dimensionless processing is carried out for the clustering processing results. Based on the linear projection matrix and the best transformation matrix, the dimensionless processing results are reduced by combining the SDP. The sparse constraint method is used to achieve feature selection of high-dimensional data in the network, and the relevant feature selection results are obtained. The experimental findings demonstrate that the suggested algorithm can effectively cluster seven different types of data and converges when the number of iterations approaches 24. The F1 value, recall, and precision are all kept at high levels. High-dimensional network data feature selection accuracy on average is 96.9%, and feature selection time on average is 65.1 milliseconds. The selection effect for network high-dimensional data features is good.
引用
收藏
页数:27
相关论文
共 50 条
  • [1] Design of feature selection algorithm for high-dimensional network data based on supervised discriminant projection
    Zhang Z.
    Luo Q.
    Ying Z.
    Chen R.
    Chen H.
    [J]. PeerJ Computer Science, 2023, 9
  • [2] Diagonal Discriminant Analysis With Feature Selection for High-Dimensional Data
    Romanes, Sarah E.
    Ormerod, John T.
    Yang, Jean Y. H.
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2020, 29 (01) : 114 - 127
  • [3] Reduction algorithm based on supervised discriminant projection for network security data
    Guo F.
    Lyu H.
    Ren W.
    Wang R.
    [J]. Tongxin Xuebao/Journal on Communications, 2021, 42 (06): : 84 - 93
  • [4] Optimal Feature Selection in High-Dimensional Discriminant Analysis
    Kolar, Mladen
    Liu, Han
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2015, 61 (02) : 1063 - 1083
  • [5] A density-based clustering algorithm for high-dimensional data with feature selection
    Qi Xianting
    Wang Pan
    [J]. 2016 2ND INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS - COMPUTING TECHNOLOGY, INTELLIGENT TECHNOLOGY, INDUSTRIAL INFORMATION INTEGRATION (ICIICII), 2016, : 114 - 118
  • [6] A differential evolution based feature combination selection algorithm for high-dimensional data
    Guan, Boxin
    Zhao, Yuhai
    Yin, Ying
    Li, Yuan
    [J]. INFORMATION SCIENCES, 2021, 547 : 870 - 886
  • [7] FsNet: Feature Selection Network on High-dimensional Biological Data
    Singh, Dinesh
    Climente-Gonzalez, Hector
    Petrovich, Mathis
    Kawakami, Eiryo
    Yamada, Makoto
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [8] Feature selection for high-dimensional data
    Bolón-Canedo V.
    Sánchez-Maroño N.
    Alonso-Betanzos A.
    [J]. Progress in Artificial Intelligence, 2016, 5 (2) : 65 - 75
  • [9] Feature selection for high-dimensional data
    Destrero A.
    Mosci S.
    De Mol C.
    Verri A.
    Odone F.
    [J]. Computational Management Science, 2009, 6 (1) : 25 - 40
  • [10] Feature selection algorithm based on optimized genetic algorithm and the application in high-dimensional data processing
    Feng, Guilian
    [J]. PLOS ONE, 2024, 19 (05):