Ensemble feature selection for single-label text classification: a comprehensive analytical study

被引:2
|
作者
Parlak, Bekir [1 ]
机构
[1] Amasya Univ, Dept Comp Engn, Amasya, Turkiye
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 26期
关键词
Text classification; Feature selection; Global; Local; Ensemble feature subsets; SCHEME;
D O I
10.1007/s00521-023-08763-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the large amount of textual data, text classification is a crucial problem in the modern era. In text classification studies, feature selection is one of the most crucial processes because it has a big impact on classification accuracy. Many feature selection techniques are suggested in the field of text classification in the literature. Each method sorts the features by assigning a score according to its algorithm. Then, the classification process is performed by selecting top-N features. However, the feature order for each method is different from each other. Each method selects by assigning a high score to the features that are important according to its algorithm, while it does not select by assigning a low score to the insignificant features. However, each method selects different distinguishing features according to its algorithm. With combinations of these distinguishing features, a higher performance classification process can be achieved. So, the classification process is to combine the features in a different order according to each method in this study. Thus, it will be observed which methods are successful or unsuccessful when combined. In addition, it was observed that the methods chose how many different features from each other. Accordingly, the classification is made by combining the features of different sizes and combining two local and two global feature selection methods. Numerous studies using three benchmark datasets have shown that the combination of feature selection approaches performs better than any single feature selection method used alone. However, some combinations have lower performance rates than individual methods. Thus, a comprehensive study was carried out in text classification domain.
引用
收藏
页码:19235 / 19251
页数:17
相关论文
共 50 条
  • [1] Ensemble feature selection for single-label text classification: a comprehensive analytical study
    Bekir Parlak
    [J]. Neural Computing and Applications, 2023, 35 : 19235 - 19251
  • [2] Document-base extraction for single-label Text Classification
    Wang, Yanbo J.
    Sanderson, Robert
    Coenen, Frans
    Leng, Paul
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2008, 5182 : 357 - +
  • [3] An Ensemble Embedded Feature Selection Method for Multi-Label Clinical Text Classification
    Guo, Yumeng
    Chung, Fulai
    Li, Guozheng
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 823 - 826
  • [4] Ensemble feature selection for multi-label text classification: An intelligent order statistics approach
    Miri, Mohsen
    Dowlatshahi, Mohammad Bagher
    Hashemi, Amin
    Rafsanjani, Marjan Kuchaki
    Gupta, Brij B.
    Alhalabi, W.
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 11319 - 11341
  • [5] Multi-label text classification with an ensemble feature space
    Department of Mathematics, Indian Institute of Technology Delhi, Hauz Khas, New Delhi, India
    [J]. J. Intelligent Fuzzy Syst., 2022, 5 (4425-4436):
  • [6] Multi-label text classification with an ensemble feature space
    Tandon, Kushagri
    Chatterjee, Niladri
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4425 - 4436
  • [7] A Comprehensive Study of Eleven Feature Selection Algorithms and their Impact on Text Classification
    Vora, Suchi
    Yang, Hui
    [J]. 2017 COMPUTING CONFERENCE, 2017, : 440 - 449
  • [8] Ensemble Learning Based Feature Selection with an Application to Text Classification
    Onan, Aytug
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [9] ROBUST SINGLE-LABEL CLASSIFICATION OF FACIAL ATTRIBUTES
    Mohammed, Ahmed Abdulateef
    Sajjanhar, Atul
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [10] Ensemble Filter-Wrapper Text Feature Selection Methods for Text Classification
    Ige, Oluwaseun Peter
    Gan, Keng Hoon
    [J]. CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024,