Chinese text categorization based on CCIPCA and SMO

被引:1
|
作者
Li, Xin-Fu [1 ]
He, Hai-Bin [1 ]
Zhao, Lei-Lei [1 ]
机构
[1] Hebei Univ, Coll Math & Comp Sci, Baoding 071002, Peoples R China
关键词
text categorization; dimension reduction; candid incremental principal component analysis (CCIPCA); sequential minimization optimization algorithm (SMO);
D O I
10.1109/ICMLC.2008.4620831
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Vector space model is usually used to express text for text categorization. How to reduce the dimensionality of feature space is a very key problem for practical text classification. The classical decomposition algorithms are incapable of dealing with the high-dimensional and large-scale text categorization problems. In this paper an approach to improving the performance of text categorization is presented by using candid incremental principal component analysis and sequential minimization optimization algorithm. The experimental result shows that the proposed method for Chinese text categorization is practicable and effective.
引用
收藏
页码:2514 / 2518
页数:5
相关论文
共 50 条
  • [1] Research on Chinese Text Automatic Categorization Based on VSM
    Tong Xiao-Jun
    Cui Ming-Gen
    Song Guo-Long
    [J]. 2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 3863 - +
  • [2] Chinese Text Categorization Based on Deep Belief Networks
    Song, Jia
    Qin, Sijun
    Zhang, Pengzhou
    [J]. 2016 IEEE/ACIS 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2016, : 1123 - 1127
  • [3] Chinese text categorization based on fuzzy association rules
    Yuan, Fang
    Guo, Yu-Qin
    Yang, Liu
    Yang, Fan
    [J]. PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1030 - +
  • [4] CHINESE TEXT CATEGORIZATION STUDY BASED ON FEATURE WEIGHT LEARNING
    Zhan, Yan
    Chen, Hao
    Zhang, Su-Fang
    Zheng, Mei
    [J]. PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 1723 - +
  • [5] Automatic Chinese Text Categorization System Based on Mutual Information
    Lu, Zhimao
    Shi, Hong
    Zhang, Qi
    Yuan, Chaoyue
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 4986 - 4990
  • [6] A Fast Algorithm for Chinese Text Categorization Based on Key Tree
    Liu Xin
    Liu Renren
    He Wenjing
    [J]. INFORMATION TECHNOLOGY FOR MANUFACTURING SYSTEMS II, PTS 1-3, 2011, 58-60 : 1106 - +
  • [7] Design of Chinese Text Categorization Classifier Based on Attribute Bagging
    Zhang, Xiang
    Zhou, Mingquan
    Dong, Lili
    Ye, Na
    [J]. 2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 201 - 204
  • [8] Examples Initialization in Chinese Text Categorization
    Cheng, Shi
    Shi, Yuhui
    Qin, Quande
    [J]. 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 967 - 971
  • [9] Novel feature selection algorithm for Chinese text categorization based on CHI
    Cai Zhenliang
    Wang Jian
    Liu Jiqiang
    [J]. PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 1035 - 1039
  • [10] A logistic regression-based smoothing method for Chinese text categorization
    Yen, Show-Jane
    Lee, Yue-Shi
    Ying, Jia-Ching
    Wu, Yu-Chieh
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (09) : 11581 - 11590