Text categorization based on granular agent evolutionary classification algorithm

被引:0
|
作者
Pan X. [1 ]
Chen H. [1 ]
Jing Z. [2 ]
机构
[1] School of Computer Science and Technology, Xi'An University of Post and Telecommunications, Xi'an, Shaanxi
[2] Xiamen Ulab Network Technology Co., Ltd., Xiamen, Fujian
基金
中国国家自然科学基金;
关键词
Category Information; Evolution; Feature Selection; Granular Agent; Term Frequency; Text Categorization;
D O I
10.1166/jctn.2016.5059
中图分类号
学科分类号
摘要
Document classification, with the blooming of the Internet information delivery, has become indispensable required and is expected to be disposed by an automatic text categorization. This paper presents a text categorization approach based on granular agent evolutionary classification algorithm to the single-labeled documents. First, a new feature selection method combined term frequency with class information is proposed according to the analyses of existed approaches. It based on the term weighting scheme, and using some useful information in other feature selection. Second, inspiration of the ideas in granular agent evolutionary classification algorithm, a new classifier is introduced in the classifying module. It causes the evolution of sets of documents, and at the end of the evolutionary process, extracts rules from these sets. Because the particularity in text categorization, some specific operators are devised for realizing the evolutionary operations performed on granular agent. Assimilation operator, exchange operator, and differentiation operator reflect the competitive, cooperative and self-learning ability of agent respectively. In experiments, the effectiveness of the proposed approach is evaluated in Reuters-21578. The test results show that the algorithm has a good recall, precision and F1 measure. In most categories, the performance of it is better than Naïve Bayes, K-nearest neighbor and support vector machine, which have good performance on the text categorization. All the results show the proposed algorithm is good. © 2016 American Scientific Publishers All rights reserved.
引用
下载
收藏
页码:1391 / 1398
页数:7
相关论文
共 50 条
  • [31] Text Categorization as a Graph Classification Problem
    Rousseau, Francois
    Kiagias, Emmanouil
    Vazirgiannis, Michalis
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 1702 - 1712
  • [32] Reference algorithm of text categorization based on fuzzy cognitive maps
    Zhang Guiyun
    Liu Yang
    Zhang Weijuan
    Wang Yuanyuan
    INTELLIGENT INFORMATION PROCESSING III, 2006, 228 : 531 - +
  • [33] An improved kNN text categorization algorithm based on cluster distribution
    Luo, Yuansheng
    Wang, Minweng
    Le, Zhongjian
    Zhang, Huawei
    Journal of Computational Information Systems, 2012, 8 (03): : 1255 - 1263
  • [34] A Fast Algorithm for Chinese Text Categorization Based on Key Tree
    Liu Xin
    Liu Renren
    He Wenjing
    INFORMATION TECHNOLOGY FOR MANUFACTURING SYSTEMS II, PTS 1-3, 2011, 58-60 : 1106 - +
  • [35] The Research of kNN Text Categorization Algorithm Based On Eager Learning
    Dong, Tao
    Cheng, Weinan
    Shang, Wenqian
    2012 INTERNATIONAL CONFERENCE ON INDUSTRIAL CONTROL AND ELECTRONICS ENGINEERING (ICICEE), 2012, : 1120 - 1123
  • [36] Multi-class Text Categorization Based on Immune Algorithm
    Zhang, Qirui
    Luo, Man
    Xue, Yonggang
    Tan, Jinghua
    2008 INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND TRAINING AND 2008 INTERNATIONAL WORKSHOP ON GEOSCIENCE AND REMOTE SENSING, VOL 1, PROCEEDINGS, 2009, : 749 - +
  • [37] Text categorization algorithm based on feature order pair quantization
    Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
    Qinghua Daxue Xuebao, 2006, 4 (527-529+533):
  • [38] A fast KNN algorithm for text categorization
    Wang, Yu
    Wang, Zheng-Ou
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3436 - +
  • [39] A simple KNN algorithm for text categorization
    Soucy, P
    Mineau, GW
    2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 647 - 648
  • [40] Using KNN Algorithm for Text Categorization
    Wajeed, M. A.
    Adilakshmi, T.
    COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 796 - +