Support-vector-based iteratively adjusted centroid classifier for text categorization

被引:0
|
作者
Wang, Deqing [1 ]
Zhang, Hui [1 ]
机构
[1] State Key Laboratory of Software Development Environment, Beijing University of Aeronautics and Astronautics, Beijing 100191, China
关键词
Text processing - Vectors - Iterative methods;
D O I
暂无
中图分类号
学科分类号
摘要
To address the lackness of centroid-based classifier (CC) that is prone to generate inductive bias or model misfit, a support-vector-based iteratively-adjusted centroid classifier (IACC_SV) was proposed, which employs support vectors found by some routines, e.g., linear support vector machines (SVMs) to construct centroid vectors for CC, and then iteratively adjusts the initial centroid vectors according to the misclassified training samples. Compared with traditional classification algorithms, IACC_SV achieves better performance in terms of macro-F1 and micro-F1, and the extensive experiments on 8 real-world text corpora demonstrate the effectiveness of the proposed algorithm, especially on text corpora with highly imbalanced classes.
引用
收藏
页码:269 / 274
相关论文
共 50 条
  • [21] Projected-prototype based classifier for text categorization
    Zhang, Jianfei
    Chen, Lifei
    Guo, Gongde
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 49 : 179 - 189
  • [22] A distance-based classifier for arabic text categorization
    Duwairi, RM
    [J]. DMIN '05: Proceedings of the 2005 International Conference on Data Mining, 2005, : 187 - 192
  • [23] Term-length normalization for centroid-based text categorization
    Lertnattee, V
    Theeramunkong, T
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2003, 2773 : 850 - 856
  • [24] Supervised term weighting centroid-based classifiers for text categorization
    Tam T. Nguyen
    Kuiyu Chang
    Siu Cheung Hui
    [J]. Knowledge and Information Systems, 2013, 35 : 61 - 85
  • [25] Supervised term weighting centroid-based classifiers for text categorization
    Nguyen, Tam T.
    Chang, Kuiyu
    Hui, Siu Cheung
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 35 (01) : 61 - 85
  • [26] String Vector based KNN for Text Categorization
    Jo, Taeho
    [J]. 2017 19TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATIONS TECHNOLOGY (ICACT) - OPENING NEW ERA OF SMART SOCIETY, 2017, : 458 - 463
  • [27] Virtual relevant documents in text categorization with support vector machines
    Lee, Kyung-Soon
    Kageura, Kyo
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (04) : 902 - 913
  • [28] Exploring Feature Selection and Support Vector Machine in Text Categorization
    Abdul-Rahman, Shuzlina
    Mutalib, Sofianita
    Khanafi, Nur Amira
    Ali, Azliza Mohd
    [J]. 2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 1101 - 1104
  • [29] A new transductive support vector machine approach to text categorization
    Sun, F
    Sun, MS
    [J]. PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 631 - 635
  • [30] An Improved Algorithm for Multiclass Text Categorization with Support Vector Machine
    Shao, Fubo
    He, Guoping
    Zhang, Xin
    [J]. PROCEEDINGS OF THE 2008 INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN, VOL 1, 2008, : 336 - 339