Text categorization based on granular agent evolutionary classification algorithm

被引:0
|
作者
Pan X. [1 ]
Chen H. [1 ]
Jing Z. [2 ]
机构
[1] School of Computer Science and Technology, Xi'An University of Post and Telecommunications, Xi'an, Shaanxi
[2] Xiamen Ulab Network Technology Co., Ltd., Xiamen, Fujian
基金
中国国家自然科学基金;
关键词
Category Information; Evolution; Feature Selection; Granular Agent; Term Frequency; Text Categorization;
D O I
10.1166/jctn.2016.5059
中图分类号
学科分类号
摘要
Document classification, with the blooming of the Internet information delivery, has become indispensable required and is expected to be disposed by an automatic text categorization. This paper presents a text categorization approach based on granular agent evolutionary classification algorithm to the single-labeled documents. First, a new feature selection method combined term frequency with class information is proposed according to the analyses of existed approaches. It based on the term weighting scheme, and using some useful information in other feature selection. Second, inspiration of the ideas in granular agent evolutionary classification algorithm, a new classifier is introduced in the classifying module. It causes the evolution of sets of documents, and at the end of the evolutionary process, extracts rules from these sets. Because the particularity in text categorization, some specific operators are devised for realizing the evolutionary operations performed on granular agent. Assimilation operator, exchange operator, and differentiation operator reflect the competitive, cooperative and self-learning ability of agent respectively. In experiments, the effectiveness of the proposed approach is evaluated in Reuters-21578. The test results show that the algorithm has a good recall, precision and F1 measure. In most categories, the performance of it is better than Naïve Bayes, K-nearest neighbor and support vector machine, which have good performance on the text categorization. All the results show the proposed algorithm is good. © 2016 American Scientific Publishers All rights reserved.
引用
下载
收藏
页码:1391 / 1398
页数:7
相关论文
共 50 条
  • [21] KNN Text Categorization Algorithm Based on Semantic Centre
    Zhang Xiao-fei
    Huang He-yan
    Zhang Ke-liang
    2009 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE, VOL 1, PROCEEDINGS, 2009, : 249 - +
  • [22] Text categorization with KNN algorithm
    Zhang, Ning
    Jia, Ziyan
    Shi, Zhongzhi
    Jisuanji Gongcheng/Computer Engineering, 2005, 31 (08): : 171 - 172
  • [23] An algorithm for text categorization with SVM
    Hu, J
    Huang, HK
    2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 47 - 50
  • [24] Knowledge Evolutionary Algorithm Based on Granular Computing
    Tao, Yong-Qin
    Cui, Du-Wu
    Yan, Tai-Shan
    2008 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 1230 - 1235
  • [25] A Granular Evolutionary Algorithm Based on Cultural Evolution
    Meng, Zuqiang
    Shi, Zhongzhi
    ADVANCES IN COMPUTATION AND INTELLIGENCE, PROCEEDINGS, 2007, 4683 : 147 - +
  • [26] A new Centroid-Based Classification model for text categorization
    Liu, Chuan
    Wang, Wenyong
    Tu, Guanghui
    Xiang, Yu
    Wang, Siyang
    Lv, Fengmao
    KNOWLEDGE-BASED SYSTEMS, 2017, 136 : 15 - 26
  • [27] A New Fuzzy Hierarchical Classification Based on SVM for Text Categorization
    Guernine, Taoufik
    Zeroual, Kacem
    IMAGE ANALYSIS AND RECOGNITION, PROCEEDINGS, 2009, 5627 : 865 - 874
  • [28] Text categorization based on classification rules tree by frequent patterns
    Department of Computer and Information Technology, Fudan University, Shanghai 200433, China
    不详
    Ruan Jian Xue Bao, 2006, 5 (1017-1025):
  • [29] Adapting Associative Classification to Text Categorization
    Li, Baoli
    Sugandh, Neha
    Garcia, Ernest V.
    Ram, Ashwin
    DOCENG'07: PROCEEDINGS OF THE 2007 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, 2007, : 205 - 207
  • [30] A Text Classification Algorithm Based On RS
    Li, Jianlin
    PROCEEDINGS OF THE 2017 5TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND COMPUTING TECHNOLOGY (ICMMCT 2017), 2017, 126 : 105 - 108