Chinese text categorization based on CCIPCA and SMO

被引:1
|
作者
Li, Xin-Fu [1 ]
He, Hai-Bin [1 ]
Zhao, Lei-Lei [1 ]
机构
[1] Hebei Univ, Coll Math & Comp Sci, Baoding 071002, Peoples R China
关键词
text categorization; dimension reduction; candid incremental principal component analysis (CCIPCA); sequential minimization optimization algorithm (SMO);
D O I
10.1109/ICMLC.2008.4620831
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Vector space model is usually used to express text for text categorization. How to reduce the dimensionality of feature space is a very key problem for practical text classification. The classical decomposition algorithms are incapable of dealing with the high-dimensional and large-scale text categorization problems. In this paper an approach to improving the performance of text categorization is presented by using candid incremental principal component analysis and sequential minimization optimization algorithm. The experimental result shows that the proposed method for Chinese text categorization is practicable and effective.
引用
收藏
页码:2514 / 2518
页数:5
相关论文
共 50 条
  • [21] A multi-label Chinese text categorization system based on boosting algorithm
    Chen, JL
    Zhou, XZ
    Wu, ZH
    [J]. FOURTH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2004, : 1153 - 1158
  • [22] Experimental study on representing units in Chinese text categorization
    Li, BL
    Chen, YZ
    Bai, XJ
    Yu, SW
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS, 2003, 2588 : 602 - 614
  • [23] Using maximum entropy model for Chinese text categorization
    Li, RL
    Tao, XP
    Tang, L
    Hu, YF
    [J]. ADVANCED WEB TECHNOLOGIES AND APPLICATIONS, 2004, 3007 : 578 - 587
  • [24] A high performance prototype system for Chinese text categorization
    Fan, Xinghua
    [J]. MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 1017 - 1026
  • [25] An Improved Feature Weighting Strategy in Chinese Text Categorization
    Song, Jia
    Qin, Sijun
    Zhang, Pengzhou
    [J]. PROCEEDINGS OF THE 2015 6TH INTERNATIONAL CONFERENCE ON MANUFACTURING SCIENCE AND ENGINEERING, 2016, 32 : 202 - 208
  • [26] Chinese text categorization based on the binary weighting model with non-binary smoothing
    Xue, D
    Sun, MS
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 408 - 419
  • [27] Particle Swarm Optimization based Semi-Supervised Learning on Chinese Text Categorization
    Cheng, Shi
    Shi, Yuhui
    Qin, Quande
    [J]. 2012 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2012,
  • [28] Research on Enhancing the Effectiveness of the Chinese Text Automatic Categorization Based on ICTCLAS Segmentation Method
    Li, Xiangdong
    Zhang, Cheng
    [J]. PROCEEDINGS OF 2013 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2012, : 267 - 270
  • [29] Kernel-based text categorization
    Jalam, R
    Teytaud, O
    [J]. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1891 - 1896
  • [30] Research of text categorization based on SVM
    Wang, Meihua
    Zhang, Hongbin
    Ding, Renshuang
    [J]. 2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL I, 2010, : 676 - 679