Using clustering and co-training to boost classification performance

被引:3
|
作者
Kyriakopoulou, Antonia [1 ]
机构
[1] Univ Athens Econ Business, Dept Informat, GR-10434 Athens, Greece
关键词
D O I
10.1109/ICTAI.2007.146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper shows that the performance of a linear SVM classifier can be improved by utilizing meta-information derived from clustering. Clustering aims in discovering extra knowledge concerning the structure of the whole dataset, (both training and testing set). A co-training algorithm is introduced that uses clustering as a complementary step to text classification. At each iteration step of the algorithm the clustering phase augments the feature space with a new meta-feature that for each document reflects cluster membership and the classification phase introduces another meta-feature that indicates class membership. Experimental results obtained using widely used datasets demonstrate the effectiveness of the proposed approaches especially for small training sets.
引用
收藏
页码:325 / 330
页数:6
相关论文
共 50 条
  • [21] A Co-Training Strategy for Multiple View Clustering in Process Mining
    Appice, Annalisa
    Malerba, Donato
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2016, 9 (06) : 832 - 845
  • [22] A subspace co-training framework for multi-view clustering
    Zhao, Xuran
    Evans, Nicholas
    Dugelay, Jean-Luc
    PATTERN RECOGNITION LETTERS, 2014, 41 : 73 - 82
  • [23] Efficient Classification via Partial Co-Training for Virtual Metrology
    Nguyen, Cuong
    Li, Xin
    Blanton, Shawn
    Li, Xiang
    2020 25TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2020, : 753 - 760
  • [24] Network traffic classification based on ensemble learning and co-training
    HaiTao He
    XiaoNan Luo
    FeiTeng Ma
    ChunHui Che
    JianMin Wang
    Science in China Series F: Information Sciences, 2009, 52 : 338 - 346
  • [25] Using Co-Training to Empower Active Learning
    Azad, Payam V.
    Yaslan, Yusuf
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [26] Bilingual Co-Training for Sentiment Classification of Chinese Product Reviews
    Wan, Xiaojun
    COMPUTATIONAL LINGUISTICS, 2011, 37 (03) : 587 - 616
  • [27] DCPE Co-Training: Co-Training Based on Diversity of Class Probability Estimation
    Xu, Jin
    He, Haibo
    Man, Hong
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [28] Network traffic classification based on ensemble learning and co-training
    He HaiTao
    Luo XiaoNan
    Ma FeiTeng
    Che ChunHui
    Wang JianMin
    SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2009, 52 (02): : 338 - 346
  • [29] Graph based co-training algorithm for web page classification
    Hou, Cui-Qin
    Jiao, Li-Cheng
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2009, 37 (10): : 2173 - 2180
  • [30] Network traffic classification based on ensemble learning and co-training
    HE HaiTao1
    2 Key Laboratory of Digital Life (Sun Yat-sen University)
    3 Information and Network Center
    Science China(Information Sciences), 2009, (02) : 338 - 346