Feature selection for binary classification based on class labeling, SOM, and hierarchical clustering

被引:0
|
作者
Zhao Zhengtian [1 ,2 ]
Rui Zhiyuan [1 ,2 ,3 ]
Duan Xiaoyan [2 ]
机构
[1] Lanzhou Univ Technol, Sch Comp & Commun, Lanzhou, Peoples R China
[2] Lanzhou Univ Technol, Coll Elect & Informat Engn, Lanzhou, Peoples R China
[3] Lanzhou Univ Technol, Sch Mech & Elect Engn, Lanzhou, Peoples R China
来源
MEASUREMENT & CONTROL | 2023年 / 56卷 / 9-10期
基金
中国国家自然科学基金;
关键词
Feature selection; class labeling; SOM; hierarchical clustering; MUTUAL INFORMATION; GRAPH;
D O I
10.1177/00202940231173748
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection plays an important role in algorithms for processing high-dimensional data. Traditional pattern classification and information theory methods are widely applied to feature selection methods. However, traditional pattern classification methods such as Fisher Score, Laplacian Score, and relief use class labels inadequately. Previous information theory based feature selection methods such as MIFS ignore the intra-class to tight inter-class to sparse property of the samples. To address these problems, a feature selection algorithm for the binary classification problem is proposed, which is based on class label transformation using self-organizing mapping neural network (SOM) and cohesive hierarchical clustering. The algorithm first converts class labels without numerical meaning into numerical values that can participate in operations and retain classification information through class label mapping, and constitutes a two-dimensional vector from it and the attribute values to be judged. Then, these two-dimensional vectors are clustered by using SOM neural network and hierarchical clustering. Finally, evaluation function value is calculated, that is closely related to intra-cluster to tightness, inter-cluster separation, and division accuracy after clustering, and is used to evaluate the ability of alternative attributes to distinguish between classes. It is experimentally verified that the algorithm is robust and can effectively screen attributes with strong classification ability and improve the prediction performance of the classifier.
引用
收藏
页码:1649 / 1669
页数:21
相关论文
共 50 条
  • [31] Introducing clustering based population in Binary Gravitational Search Algorithm for Feature Selection
    Guha, Ritam
    Ghosh, Manosij
    Chakrabarti, Akash
    Sarkar, Ram
    Mirjalili, Seyedali
    [J]. APPLIED SOFT COMPUTING, 2020, 93
  • [32] Fast Simultaneous Clustering and Feature Selection for Binary Data
    Laclau, Charlotte
    Nadif, Mohamed
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS XIII, 2014, 8819 : 192 - 202
  • [33] Feature subset selection in SOM based text categorization
    Bassiouny, S
    Nagi, M
    Hussein, MF
    [J]. IC-AI '04 & MLMTA'04 , VOL 1 AND 2, PROCEEDINGS, 2004, : 860 - 866
  • [34] Using class-based feature selection for the classification of hyperspectral data
    Maghsoudi, Yasser
    Zoej, Mohammad Javad Valadan
    Collins, Michael
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2011, 32 (15) : 4311 - 4326
  • [35] Clustering-based Binary-class Classification for Imbalanced Data Sets
    Chen, Chao
    Shyu, Mei-Ling
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2011, : 384 - 389
  • [36] HIERARCHICAL POLARIMETRIC SAR IMAGE CLASSIFICATION BASED ON FEATURE SELECTION AND GENETIC ALGORITHM
    Wang, Yunyan
    Zhuo, Tong
    Zhang, Yu
    Liao, Mingsheng
    [J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 764 - 768
  • [37] A DYNAMIC HIERARCHICAL FEATURE SELECTION METHOD FOR OBJECT-BASED CLASSIFICATION OF WETLANDS
    Mahdavi, Sahel
    Salehi, Bahram
    Amani, Meisam
    Granger, Jean
    Brisco, Brian
    Huang, Weimin
    [J]. 2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 570 - 573
  • [38] Label-correlation-based Common and Specific Feature Selection for Hierarchical Classification
    Lin Y.-J.
    Bai S.-X.
    Zhao H.
    Li S.-Z.
    Hu Q.-H.
    [J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (07): : 2667 - 2682
  • [39] TESTING FOR TRANSITIVE CLASS CONTAINMENT AS A FEATURE OF HIERARCHICAL CLASSIFICATION
    Slattery, Brian
    Stewart, Ian
    O'Hora, Denis
    [J]. JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR, 2011, 96 (02) : 243 - 260
  • [40] Fuzzy Rough Sets-Based Incremental Feature Selection for Hierarchical Classification
    Huang, Wanli
    She, Yanhong
    He, Xiaoli
    Ding, Weiping
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2023, 31 (10) : 3721 - 3733