Optimized Approach of Feature Selection based on Information Gain

被引:10
|
作者
Wu, Guohua [1 ]
Xu, Junjun [1 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
关键词
text classification; feature selection; information gain; SVM;
D O I
10.1109/CSMA.2015.38
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text feature selection is the key technology in text classification and text information retrieval. The feature selection method - information gain - has extensive application in text categorization. This paper theoretically analyzed the deficiency of information gain in feature selection methods, and then introduced two improvement factors which were LDFWF (Limiting Document Frequency's Word Frequency) and DI (Distribution Information), on this basis an improved text feature selection method was proposed. In this paper, the experiments used the SVM classifier for text classification, text feature selection methods respectively used information gain and the improved information gain that this paper proposed, the results show that the method effectively improve the accuracy of text classification.
引用
收藏
页码:157 / 161
页数:5
相关论文
共 50 条
  • [1] Gabor Feature Selection Based on Information Gain
    Lefkovits, Szidonia
    Lefkovits, Laszlo
    [J]. 10TH INTERNATIONAL CONFERENCE INTERDISCIPLINARITY IN ENGINEERING, INTER-ENG 2016, 2017, 181 : 892 - 898
  • [2] Assessment of Sentiment Analysis Using Information Gain Based Feature Selection Approach
    Madhumathi, R.
    Kowshalya, A. Meena
    Shruthi, R.
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 43 (02): : 849 - 860
  • [3] An Improved Feature Selection Method Based on Information Gain
    Li, Yanling
    Sun, Wenxia
    [J]. INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING BIOMEDICAL ENGINEERING, AND INFORMATICS (SPBEI 2013), 2014, : 530 - 535
  • [4] Acoustic Based Approach of Sewer Blockage Recognition Using Information Gain for Feature Selection
    Zhu, Xuefeng
    Feng, Zao
    Wu, Jiande
    Ma, Jun
    [J]. Zhendong Ceshi Yu Zhenduan/Journal of Vibration, Measurement and Diagnosis, 2021, 41 (02): : 267 - 274
  • [5] A Hybrid Feature Selection Approach for Parkinson’s Detection Based on Mutual Information Gain and Recursive Feature Elimination
    Rohit Lamba
    Tarun Gulati
    Anurag Jain
    [J]. Arabian Journal for Science and Engineering, 2022, 47 : 10263 - 10276
  • [6] A Hybrid Feature Selection Approach for Parkinson's Detection Based on Mutual Information Gain and Recursive Feature Elimination
    Lamba, Rohit
    Gulati, Tarun
    Jain, Anurag
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) : 10263 - 10276
  • [7] Research on Text Feature Selection Algorithm Based on Information Gain and Feature Relation Tree
    Zhang, Hong
    Ren, Yong-gong
    Yang, Xue
    [J]. 2013 10TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA 2013), 2013, : 446 - 449
  • [8] A Novel Feature Selection Based on VMD and Information Gain for Pipe Blockages
    Zhu, Xuefeng
    Feng, Zao
    Wu, Jiande
    Deng, Weiquan
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [9] A Developed Feature Selection Method for Classification Based on United Information Gain
    Niu, Kun
    Jiao, Haizhen
    Gao, Zhipeng
    Jia, Guannan
    Yang, Guangyu
    Cheng, Cheng
    [J]. 2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,
  • [10] On the Feature Selection and Classification Based on Information Gain for Document Sentiment Analysis
    Pratiwi, Asriyanti Indah
    Adiwijaya
    [J]. APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2018, 2018