Chinese text categorization based on CCIPCA and SMO

被引:1
|
作者
Li, Xin-Fu [1 ]
He, Hai-Bin [1 ]
Zhao, Lei-Lei [1 ]
机构
[1] Hebei Univ, Coll Math & Comp Sci, Baoding 071002, Peoples R China
关键词
text categorization; dimension reduction; candid incremental principal component analysis (CCIPCA); sequential minimization optimization algorithm (SMO);
D O I
10.1109/ICMLC.2008.4620831
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Vector space model is usually used to express text for text categorization. How to reduce the dimensionality of feature space is a very key problem for practical text classification. The classical decomposition algorithms are incapable of dealing with the high-dimensional and large-scale text categorization problems. In this paper an approach to improving the performance of text categorization is presented by using candid incremental principal component analysis and sequential minimization optimization algorithm. The experimental result shows that the proposed method for Chinese text categorization is practicable and effective.
引用
收藏
页码:2514 / 2518
页数:5
相关论文
共 50 条
  • [31] Text Categorization Based on Topic Model
    Shibin Zhou
    Kan Li
    Yushu Liu
    [J]. International Journal of Computational Intelligence Systems, 2009, 2 (4) : 398 - 409
  • [32] A KNN BASED ALGORITHM FOR TEXT CATEGORIZATION
    Bucar, Joze
    Povh, Janez
    [J]. SOR'13 PROCEEDINGS: THE 12TH INTERNATIONAL SYMPOSIUM ON OPERATIONAL RESEARCH IN SLOVENIA, 2013, : 367 - 372
  • [33] A Learning Based Handwritten Text Categorization
    Sarker, Goutam
    Dhua, Silpi
    Besra, Monica
    [J]. 2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND APPLICATIONS (ICACEA), 2015, : 465 - 471
  • [34] Text categorization based on subtopic clusters
    Chik, FCY
    Luk, RWP
    Chung, KFL
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2005, 3513 : 203 - 214
  • [35] Text categorization based on domain ontology
    He, QM
    Qiu, L
    Zhao, GT
    Wang, SK
    [J]. WEB INFORMATION SYSTEMS - WISE 2004, PROCEEDINGS, 2004, 3306 : 319 - 324
  • [36] Text Categorization Based on Topic Model
    Zhou, Shibin
    Li, Kan
    Liu, Yushu
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2009, 2 (04) : 398 - 409
  • [37] Research of Text Categorization Based on Ontology
    Wang Jiayun
    Zhang Rui
    Wang Peng
    [J]. PROCEEDINGS OF 2009 CONFERENCE ON COMMUNICATION FACULTY, 2009, : 167 - 170
  • [38] Graph based KNN for Text Categorization
    Jo, Taeho
    [J]. 2018 20TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2018, : 260 - 265
  • [39] Research of Text Categorization Based on SVM
    Wang, Meihua
    Zhang, Hongbin
    Ding, Renshuang
    [J]. PROCEEDINGS OF THE 2011 INTERNATIONAL CONFERENCE ON INFORMATICS, CYBERNETICS, AND COMPUTER ENGINEERING (ICCE2011), VOL 2: INFORMATION SYSTEMS AND COMPUTER ENGINEERING, 2011, 111 : 69 - 77
  • [40] Macro Features Based Text Categorization
    Wang, Dandan
    Chen, Qingcai
    Wang, Xiaolong
    Tang, Buzhou
    [J]. NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 211 - 219