Traditional and Swarm Intelligent Based Text Feature Selection for Document Classification

被引:0
|
作者
Kyaw, Khin Sandar [1 ]
Limsiroratana, Somchai [1 ]
机构
[1] Prince Songkla Univ, Dept Comp Engn, Hat Yai, Thailand
关键词
Text mining; Meta-heuristic algorithms; Feature reduction; J48; Classifier; Correlation-based feature selection;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the age of information, the role of text feature selection has become a hot challenge for building automatic document classification models in various application areas. Meanwhile, various powerful schemes dealing with text feature selection are being developed continuously nowadays, but there still exists a research gap for "feature optimization problem": looking for the optimal features by taking account not only local optimism but also global optimism by avoiding the bias problem for unbalance and high dimension text feature. In this paper, we observed swarm-based searching policy which includes Ant Colony Optimization (ACO) and Artificial Bee Colony (ABC) to overcome the local optimization problem of high dimensional feature selection for BBC News document classification. Then, we compared the results using various evaluation measurements for performance and computation cost. The experimental results show that the selected feature using swarm intelligence search is more robust for classification results.
引用
收藏
页码:226 / 231
页数:6
相关论文
共 50 条
  • [1] Weighted Document Frequency for Feature Selection in Text Classification
    Li, Baoli
    Yan, Qiuling
    Xu, Zhenqiang
    Wang, Guicai
    [J]. PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 132 - 135
  • [2] Improved Document Feature Selection with Categorical Parameter for Text Classification
    Wang, Fen
    Li, Xiaoxuan
    Huang, Xiaotao
    Kang, Ling
    [J]. MOBILE, SECURE, AND PROGRAMMABLE NETWORKING (MSPN 2016), 2016, 10026 : 86 - 98
  • [3] Feature selection for document classification based on topology
    El Barbary, O. G.
    Salama, A. S.
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2018, 19 (02) : 129 - 132
  • [4] A REVIEW OF TRADITIONAL AND SWARM SEARCH BASED FEATURE SELECTION ALGORITHMS FOR HANDLING DATA STREAM CLASSIFICATION
    Sasikala, S.
    Devi, D. Renuka
    [J]. 2017 IEEE 3RD INTERNATIONAL CONFERENCE ON SENSING, SIGNAL PROCESSING AND SECURITY (ICSSS), 2017, : 514 - 520
  • [5] OPTIMAL FEATURE SUBSET SELECTION BASED ON COMBINING DOCUMENT FREQUENCY AND TERM FREQUENCY FOR TEXT CLASSIFICATION
    Karpagalingam, Thirumoorthy
    Karuppaiah, Muneeswaran
    [J]. COMPUTING AND INFORMATICS, 2020, 39 (05) : 881 - 906
  • [6] Optimal feature subset selection based on combining document frequency and term frequency for text classification
    Karpagalingam, Thirumoorthy
    Karuppaiah, Muneeswaran
    [J]. 1600, Slovak Academy of Sciences (39): : 881 - 906
  • [7] Text document classification using swarm intelligence
    Vizine, AL
    de Castro, LN
    Gudwin, RR
    [J]. 2005 INTERNATIONAL CONFERENCE ON INTEGRATION OF KNOWLEDGE INTENSIVE MULTI-AGENT SYSTEMS: KIMAS'05: MODELING, EXPLORATION, AND ENGINEERING, 2005, : 134 - 139
  • [8] Feature Selection in Text Classification
    Sahin, Durmus Ozkan
    Ates, Nurullah
    Kilic, Erdal
    [J]. 2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1777 - 1780
  • [9] Utility-based feature selection for text classification
    Heyong Wang
    Ming Hong
    Raymond Yiu Keung Lau
    [J]. Knowledge and Information Systems, 2019, 61 : 197 - 226
  • [10] Utility-based feature selection for text classification
    Wang, Heyong
    Hong, Ming
    Lau, Raymond Yiu Keung
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 61 (01) : 197 - 226