Two-Stage Feature Selection for Text Classification

被引:7
|
作者
Ozgur, Levent [1 ]
Gungor, Tunga [1 ]
机构
[1] Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
来源
INFORMATION SCIENCES AND SYSTEMS 2015 | 2016年 / 363卷
关键词
D O I
10.1007/978-3-319-22635-4_30
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we focus on feature coverage policies used for feature selection in the text classification domain. Two alternative policies are discussed and compared: corpus-based and class-based selection of features. We make a detailed analysis of pruning and keyword selection by varying the parameters of the policies and obtain the optimal usage patterns. In addition, by combining the optimal forms of these methods, we propose a novel two-stage feature selection approach. The experiments on three independent datasets showed that the proposed method results in a statistically significant increase over the traditional methods in the success rates of the classifier.
引用
收藏
页码:329 / 337
页数:9
相关论文
共 50 条
  • [1] On Two-Stage Feature Selection Methods for Text Classification
    Uysal, Alper Kursat
    IEEE ACCESS, 2018, 6 : 43233 - 43251
  • [2] Two-stage Feature Selection Method for Text Classification
    Li Xi
    Dai Hang
    Wang Mingwen
    MINES 2009: FIRST INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY, VOL 1, PROCEEDINGS, 2009, : 234 - +
  • [3] A Two-stage Text Feature Selection Algorithm for Improving Text Classification
    Ashokkumar, P.
    Shankar, Siva G.
    Srivastava, Gautam
    Maddikunta, Praveen Kumar Reddy
    Gadekallu, Thippa Reddy
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (03)
  • [4] Revisiting two-stage feature selection based on coverage policies for text classification
    Mendez-Molina, Arquimides
    Li Ona-Garcia, Ana
    Ariel Carrasco-Ochoa, Jesus
    Martinez-Trinidad, Jose Fco.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (05) : 2949 - 2957
  • [5] A two-stage Markov blanket based feature selection algorithm for text classification
    Javed, Kashif
    Maruf, Sameen
    Babri, Haroon A.
    NEUROCOMPUTING, 2015, 157 : 91 - 104
  • [6] A two-stage feature selection method for text categorization
    Meng, Jiana
    Lin, Hongfei
    Yu, Yuhai
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2011, 62 (07) : 2793 - 2800
  • [7] Improving Farsi Multiclass Text Classification Using a Thesaurus and Two-Stage Feature Selection
    Maghsoodi, Nooshin
    Homayounpour, Mohammad Mehdi
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2011, 62 (10): : 2055 - 2066
  • [8] Adaptive Two-Stage Feature Selection for Sentiment Classification
    Chi, Xu
    Cambria, Erik
    Siew, Tan Puay
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 1238 - 1243
  • [9] Two-stage classification with automatic feature selection for an industrial application
    Hader, S
    Hamprecht, FA
    Classification - the Ubiquitous Challenge, 2005, : 137 - 144
  • [10] A novel two-stage wrapper feature selection approach based on greedy search for text sentiment classification
    Sagbas, Ensar Arif
    NEUROCOMPUTING, 2024, 590