An Improved Algorithm for Multiclass Text Categorization with Support Vector Machine

被引:0
|
作者
Shao, Fubo [1 ]
He, Guoping [1 ]
Zhang, Xin [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Informat Sci & Engn, Qingdao 266510, Peoples R China
关键词
D O I
10.1109/ISCID.2008.152
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automated text categorization is attractive because it frees organizations from the need of manually organizing document bases. Support Vector Machine (SVM) is an efficient technique for text categorization. Computing kernel matrix is the key in text categorization with SVM. When the kind of texts is large, the matrix of texts will become sparse. If we compute the kernel matrix directly, it will waste much time and memory space. To save time, the paper explored the hash function in the process of computing the kernel matrix. Then we propose an improved algorithm for multiclass text categorization. The paper also gives the good property of the improved algorithm from the theoretical and experimental aspects. We compared the improved algorithm with the original algorithm. Experiment shows that the improved algorithm can save much computational time.
引用
收藏
页码:336 / 339
页数:4
相关论文
共 50 条
  • [1] Automated text categorization using support vector machine
    Kwok, JTY
    [J]. ICONIP'98: THE FIFTH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING JOINTLY WITH JNNS'98: THE 1998 ANNUAL CONFERENCE OF THE JAPANESE NEURAL NETWORK SOCIETY - PROCEEDINGS, VOLS 1-3, 1998, : 347 - 351
  • [2] A new transductive support vector machine approach to text categorization
    Sun, F
    Sun, MS
    [J]. PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 631 - 635
  • [3] Exploring Feature Selection and Support Vector Machine in Text Categorization
    Abdul-Rahman, Shuzlina
    Mutalib, Sofianita
    Khanafi, Nur Amira
    Ali, Azliza Mohd
    [J]. 2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 1101 - 1104
  • [4] Application for Web Text Categorization Based on Support Vector Machine
    Pan Hao
    Duan Ying
    Tan Longyuan
    [J]. 2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 2, PROCEEDINGS, 2009, : 42 - 45
  • [5] An improved algorithm of the support vector machine
    Du Hongle
    [J]. 2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 1315 - 1317
  • [6] Fuzzy support vector machine for multi-class text categorization
    Wang, Tai-Yue
    Chiang, Huei-Min
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (04) : 914 - 929
  • [7] A method of Chinese text categorization based on proximal support vector machine
    Zhou, JG
    Wang, K
    Wu, J
    Yan, PL
    Wu, M
    [J]. PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 1615 - 1619
  • [8] Early detection of gradual concept drifts by text categorization and Support Vector Machine techniques: The TRIO algorithm
    Marseguerra, M.
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2014, 129 : 1 - 9
  • [9] Structural improved regular simplex support vector machine for multiclass classification
    Tang, Long
    Tian, Yingjie
    Li, Wenjun
    Pardalos, Panos M.
    [J]. APPLIED SOFT COMPUTING, 2020, 91
  • [10] Multiclass Lagrangian support vector machine
    Jae Pil Hwang
    Baehoon Choi
    In Wha Hong
    Euntai Kim
    [J]. Neural Computing and Applications, 2013, 22 : 703 - 710