A High Performance Algorithm for Text Feature Automatic Selection

被引:0
|
作者
Dai, Jin [1 ]
He, Zhongshi [1 ]
Hu, Feng [2 ]
机构
[1] Chongqing Univ, Coll Comp Sci, Chongqing 400030, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Inst Comp Sci & Technol, Chongqing 400065, Peoples R China
关键词
text classification; feature selection; cloud model; membership degree; dynamic clustering;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is an effective method for reducing the size of text feature space. So far, some effective methods for feature selection have been developed. For the purpose of acquiring the optimal number of features, these methods mainly depend on observation or experience. In this paper, by combining the overall with the local distribution of features in categories, a high performance algorithm for feature automation selection (Named FAS) is proposed. By using FAS, the feature set can be obtained automatically. Besides, it can effectively amend the distribution of features by using cloud model theory. Analysis and open experimental results show the selected feature set has fewer features and better classification performance than the existing methods.
引用
收藏
页码:414 / +
页数:2
相关论文
共 50 条
  • [31] An Algorithm of Feature Selection in Text Categorization Based on Gini-index
    Zhu, Wei-Dong
    Wang, Bo
    Lin, Yong-Min
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND MANAGEMENT INNOVATION, 2015, 6 : 272 - 278
  • [32] The Research Of Feature Selection Of Text Classification Based On Integrated Learning Algorithm
    Xia Huosong
    Liu Jian
    [J]. 2011 TENTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES), 2011, : 20 - 22
  • [33] Feature selection algorithm for text classification based on improved mutual information
    丛帅
    张积宾
    徐志明
    王宇颖
    [J]. Journal of Harbin Institute of Technology(New series), 2011, (03) : 144 - 148
  • [34] An Improved Information Gain Feature Selection Algorithm for SVM Text Classifier
    Xu, Jiamin
    Jiang, Hong
    [J]. 2015 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY, 2015, : 273 - 276
  • [35] Lazy learner text categorization algorithm based on embedded feature selection
    Yan Peng~(1
    2.China State Information Center
    [J]. Journal of Systems Engineering and Electronics, 2009, 20 (03) : 651 - 659
  • [36] Hybrid particle swarm optimization algorithm for text feature selection problems
    Mourad Nachaoui
    Issam Lakouam
    Imad Hafidi
    [J]. Neural Computing and Applications, 2024, 36 : 7471 - 7489
  • [37] Hybrid particle swarm optimization algorithm for text feature selection problems
    Nachaoui, Mourad
    Lakouam, Issam
    Hafidi, Imad
    [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (13): : 7471 - 7489
  • [38] Lazy learner text categorization algorithm based on embedded feature selection
    Yan Peng
    Zheng Xuefeng
    Zhu Jianyong
    Xiao Yunhong
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2009, 20 (03) : 651 - 659
  • [39] Best terms: an efficient feature-selection algorithm for text categorization
    Fragoudis, D
    Meretakis, D
    Likothanassis, S
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 8 (01) : 16 - 33
  • [40] A Study of Odds Ratio Algorithm Used for Local Text Feature Selection
    Li Gang
    Xia Chenxi
    [J]. SEVENTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, VOLS I-III: UNLOCKING THE FULL POTENTIAL OF GLOBAL TECHNOLOGY, 2008, : 1132 - 1137