Using clustering and co-training to boost classification performance

被引:3
|
作者
Kyriakopoulou, Antonia [1 ]
机构
[1] Univ Athens Econ Business, Dept Informat, GR-10434 Athens, Greece
关键词
D O I
10.1109/ICTAI.2007.146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper shows that the performance of a linear SVM classifier can be improved by utilizing meta-information derived from clustering. Clustering aims in discovering extra knowledge concerning the structure of the whole dataset, (both training and testing set). A co-training algorithm is introduced that uses clustering as a complementary step to text classification. At each iteration step of the algorithm the clustering phase augments the feature space with a new meta-feature that for each document reflects cluster membership and the classification phase introduces another meta-feature that indicates class membership. Experimental results obtained using widely used datasets demonstrate the effectiveness of the proposed approaches especially for small training sets.
引用
收藏
页码:325 / 330
页数:6
相关论文
共 50 条
  • [31] Network traffic classification based on ensemble learning and co-training
    HE HaiTaoLUO XiaoNanMA FeiTengCHE ChunHui WANG JianMin School of Information Science and TechnologySun YatSen UniversityGuangzhou China Key Laboratory of Digital Life Sun Yatsen UniversityMinistry of EducationGuangzhou China Information and Network CenterSun YatSen UniversityGuangzhou China
    ScienceinChina(SeriesF:InformationSciences), 2009, 52 (02) : 338 - 346
  • [32] Semi-Supervised Classification with Co-training for Deep Web
    Fang Wei
    Cui Zhiming
    ADVANCED MEASUREMENT AND TEST, PARTS 1 AND 2, 2010, 439-440 : 183 - +
  • [33] Catalyst: Combining Co-training and Active Learning for Lifelong Classification
    Ryndin, Maxim A.
    Turdakov, Denis Y.
    Kuznetsov, Sergey D.
    2020 IVANNIKOV ISPRAS OPEN CONFERENCE (ISPRAS 2020), 2020, : 96 - 101
  • [34] Spatial co-training for semi-supervised image classification
    Hong, Yi
    Zhu, Weiping
    PATTERN RECOGNITION LETTERS, 2015, 63 : 59 - 65
  • [35] Deep Co-Training Active Learning for Mammographic Images Classification
    Yang, Zhikai
    Wu, Wei
    Zhang, Jingyang
    Zhao, Yu
    Gu, Lixu
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 1059 - 1062
  • [36] Bayesian Co-Training
    Yu, Shipeng
    Krishnapuram, Balaji
    Rosales, Romer
    Rao, R. Bharat
    JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 2649 - 2680
  • [37] A Co-training Approach for Multi-view Density Peak Clustering
    Ling, Yu
    He, Jinrong
    Ren, Silin
    Pan, Heng
    He, Guoliang
    PATTERN RECOGNITION AND COMPUTER VISION, PT III, 2018, 11258 : 503 - 513
  • [38] Multiview Subspace Clustering via Co-Training Robust Data Representation
    Liu, Jiyuan
    Liu, Xinwang
    Yang, Yuexiang
    Guo, Xifeng
    Kloft, Marius
    He, Liangzhong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5177 - 5189
  • [39] ROBUST CO-TRAINING
    Sun, Shiliang
    Jin, Feng
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2011, 25 (07) : 1113 - 1126
  • [40] Semi-supervised hyperspectral classification from a small number of training samples using a co-training approach
    Romaszewski, Michal
    Glomb, Przemyslaw
    Cholewa, Michal
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 121 : 60 - 76