Feature selection for binary classification based on class labeling, SOM, and hierarchical clustering

被引:0
|
作者
Zhao Zhengtian [1 ,2 ]
Rui Zhiyuan [1 ,2 ,3 ]
Duan Xiaoyan [2 ]
机构
[1] Lanzhou Univ Technol, Sch Comp & Commun, Lanzhou, Peoples R China
[2] Lanzhou Univ Technol, Coll Elect & Informat Engn, Lanzhou, Peoples R China
[3] Lanzhou Univ Technol, Sch Mech & Elect Engn, Lanzhou, Peoples R China
来源
MEASUREMENT & CONTROL | 2023年 / 56卷 / 9-10期
基金
中国国家自然科学基金;
关键词
Feature selection; class labeling; SOM; hierarchical clustering; MUTUAL INFORMATION; GRAPH;
D O I
10.1177/00202940231173748
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection plays an important role in algorithms for processing high-dimensional data. Traditional pattern classification and information theory methods are widely applied to feature selection methods. However, traditional pattern classification methods such as Fisher Score, Laplacian Score, and relief use class labels inadequately. Previous information theory based feature selection methods such as MIFS ignore the intra-class to tight inter-class to sparse property of the samples. To address these problems, a feature selection algorithm for the binary classification problem is proposed, which is based on class label transformation using self-organizing mapping neural network (SOM) and cohesive hierarchical clustering. The algorithm first converts class labels without numerical meaning into numerical values that can participate in operations and retain classification information through class label mapping, and constitutes a two-dimensional vector from it and the attribute values to be judged. Then, these two-dimensional vectors are clustered by using SOM neural network and hierarchical clustering. Finally, evaluation function value is calculated, that is closely related to intra-cluster to tightness, inter-cluster separation, and division accuracy after clustering, and is used to evaluate the ability of alternative attributes to distinguish between classes. It is experimentally verified that the algorithm is robust and can effectively screen attributes with strong classification ability and improve the prediction performance of the classifier.
引用
收藏
页码:1649 / 1669
页数:21
相关论文
共 50 条
  • [1] Feature selection for hierarchical clustering
    Questier, F
    Walczak, B
    Massart, DL
    Boucon, C
    de Jong, S
    [J]. ANALYTICA CHIMICA ACTA, 2002, 466 (02) : 311 - 324
  • [2] FSSOM: One novel SOM clustering algorithm based on feature selection
    Liu, Ming
    Liu, Yuan-Chao
    Wang, Xiao-Long
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 429 - 435
  • [3] Hierarchical Agglomerative Clustering Based Combined Feature Selection and Multi-Class Support Vector Machine for Brain Tumour Classification
    Maya, U. C.
    Meenakshy, K.
    [J]. JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2017, 7 (08) : 1714 - 1722
  • [4] A Feature Selection Methods Based on Concept Extraction and SOM Text Clustering Analysis
    Wang, Lin
    Jiang, Minghu
    Liao, Shasha
    Lu, Yinghua
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (1A): : 20 - 28
  • [5] Feature selection and classification of gear faults using SOM
    Liao, GL
    Shi, TL
    Li, WH
    Huang, T
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 3, PROCEEDINGS, 2005, 3498 : 556 - 560
  • [6] A Binary Approximate Naive Bayesian Classification Algorithm Based on SOM Neural Network Clustering
    Shen Honghong
    He Lili
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS, ELECTRONICS AND CONTROL (ICCSEC), 2017, : 1344 - 1347
  • [7] Hierarchical Classification and Regression with Feature Selection
    Ke, Shih-Wen
    Yeh, Chi-Wei
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2019, : 1150 - 1154
  • [8] A Recursive Regularization Based Feature Selection Framework for Hierarchical Classification
    Zhao, Hong
    Hu, Qinghua
    Zhu, Pengfei
    Wang, Yu
    Wang, Ping
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (07) : 2833 - 2846
  • [9] Dynamic feature selection in incremental hierarchical clustering
    Talavera, L
    [J]. MACHINE LEARNING: ECML 2000, 2000, 1810 : 392 - 403
  • [10] Feature selection as retrospective pruning in hierarchical clustering
    Talavera, L
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS, PROCEEDINGS, 1999, 1642 : 75 - 86