Filter feature selection methods for text classification: a review

被引:1
|
作者
Ming, Hong [1 ]
Heyong, Wang [1 ]
机构
[1] South China Univ Technol, Dept Elect Business, Guangzhou 510006, Peoples R China
关键词
Text classification; Filter feature selection; Review; UNSUPERVISED FEATURE-SELECTION; AUTOMATIC CLASSIFICATION; RECOMMENDATION SYSTEM; DIMENSION REDUCTION; DOCUMENT FREQUENCY; ALGORITHM; SCHEME; CATEGORIZATION; EXTRACTION; FRAMEWORK;
D O I
10.1007/s11042-023-15675-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Filter feature selection methods are utilized to select discriminative terms from high-dimensional text data to improve text classification performance and reduce computational costs. This paper aims to provide a comprehensive systematic review of existing filter feature selection methods for text classification. Firstly, we briefly discuss text classification based on filter feature selection. Secondly, we present a detailed discussion on mathematical designs, effectiveness and complexity of existing filter feature selection methods of different methodologies (supervised methods, unsupervised methods and hybrid methods). In addition, a certain number of benchmark datasets for evaluating performance of filter feature selection methods in text classification are also discussion. Finally, we provide future directions in filter feature selection, along with conclusion.
引用
收藏
页码:2053 / 2091
页数:39
相关论文
共 50 条
  • [1] Filter feature selection methods for text classification: a review
    Hong Ming
    Wang Heyong
    [J]. Multimedia Tools and Applications, 2024, 83 : 2053 - 2091
  • [2] Ensemble Filter-Wrapper Text Feature Selection Methods for Text Classification
    Ige, Oluwaseun Peter
    Gan, Keng Hoon
    [J]. CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, : 1847 - 1865
  • [3] Feature selection methods for text classification: a systematic literature review
    Pintas, Julliano Trindade
    Fernandes, Leandro A. F.
    Garcia, Ana Cristina Bicharra
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (08) : 6149 - 6200
  • [4] Arabic Text Classification: A Review Study on Feature Selection Methods
    Hijazi, Musab Mustafa
    Zeki, Akram
    Ismail, Amelia
    [J]. 2021 22ND INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2021, : 554 - 559
  • [5] Feature selection methods for text classification: a systematic literature review
    Julliano Trindade Pintas
    Leandro A. F. Fernandes
    Ana Cristina Bicharra Garcia
    [J]. Artificial Intelligence Review, 2021, 54 : 6149 - 6200
  • [6] Feature Selection Methods for Text Classification
    Dasgupta, Anirban
    Drineas, Petros
    Harb, Boulos
    Josifovski, Vanja
    Mahoney, Michael W.
    [J]. KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 230 - +
  • [7] A New Performance Metric to Evaluate Filter Feature Selection Methods in Text Classification
    Cekik, Rasim
    Kaya, Mahmut
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2024, 30 (07) : 978 - 1005
  • [8] Feature selection for text classification: A review
    Deng, Xuelian
    Li, Yuqing
    Weng, Jian
    Zhang, Jilian
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (03) : 3797 - 3816
  • [9] Feature selection for text classification: A review
    Xuelian Deng
    Yuqing Li
    Jian Weng
    Jilian Zhang
    [J]. Multimedia Tools and Applications, 2019, 78 : 3797 - 3816
  • [10] Comparison on Feature Selection Methods for Text Classification
    Liu, Wenkai
    Xiao, Jiongen
    Hong, Ming
    [J]. 2020 THE 4TH INTERNATIONAL CONFERENCE ON MANAGEMENT ENGINEERING, SOFTWARE ENGINEERING AND SERVICE SCIENCES (ICMSS 2020), 2020, : 82 - 86