Using clustering and co-training to boost classification performance

被引:3
|
作者
Kyriakopoulou, Antonia [1 ]
机构
[1] Univ Athens Econ Business, Dept Informat, GR-10434 Athens, Greece
关键词
D O I
10.1109/ICTAI.2007.146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper shows that the performance of a linear SVM classifier can be improved by utilizing meta-information derived from clustering. Clustering aims in discovering extra knowledge concerning the structure of the whole dataset, (both training and testing set). A co-training algorithm is introduced that uses clustering as a complementary step to text classification. At each iteration step of the algorithm the clustering phase augments the feature space with a new meta-feature that for each document reflects cluster membership and the classification phase introduces another meta-feature that indicates class membership. Experimental results obtained using widely used datasets demonstrate the effectiveness of the proposed approaches especially for small training sets.
引用
收藏
页码:325 / 330
页数:6
相关论文
共 50 条
  • [1] DCPE co-training for classification
    Xu, Jin
    He, Haibo
    Man, Hong
    [J]. NEUROCOMPUTING, 2012, 86 : 75 - 85
  • [2] Clustering with Extended Constraints by Co-Training
    Okabe, Masayuki
    Yamada, Seiji
    [J]. 2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY WORKSHOPS (WI-IAT WORKSHOPS 2012), VOL 3, 2012, : 79 - 82
  • [3] Web classification of conceptual entities using co-training
    Sun, Aixin
    Liu, Ying
    Lim, Ee-Peng
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (12) : 14367 - 14375
  • [4] Co-training with Clustering for the Semi-supervised Classification of Remote Sensing Images
    Aydav, Prem Shankar Singh
    Minz, Sonjharia
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 2, 2016, 380 : 659 - 667
  • [5] Vertical Ensemble Co-Training for Text Classification
    Katz, Gilad
    Caragea, Cornelia
    Shabtai, Asaf
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2018, 9 (02)
  • [6] Traffic Classification Using En-semble Learning and Co-training
    He, Haitao
    Che, Chunhui
    Ma, Feiteng
    Zhang, Jun
    Luo, Xiaonan
    [J]. PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED INFORMATICS AND COMMUNICATIONS, PTS I AND II: NEW ASPECTS OF APPLIED INFORMATICS AND COMMUNICATIONS, 2008, : 458 - +
  • [7] CURL: Image Classification using co-training and Unsupervised Representation Learning
    Bianco, Simone
    Ciocca, Gianluigi
    Cusano, Claudio
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 145 : 15 - 29
  • [8] High performance query expansion using adaptive co-training
    Huang, Jimmy Xiangji
    Miao, Jun
    He, Ben
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2013, 49 (02) : 441 - 453
  • [9] HIGH ACCURATE INTERNET TRAFFIC CLASSIFICATION BASED ON CO-TRAINING SEMI-SUPERVISED CLUSTERING
    Li, Xiang
    Qi, Feng
    Yu, Li Kun
    Qiu, Xue Song
    [J]. PROCEEDINGS OF THE 2010 INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENCE AND AWARENESS INTERNET, AIAI2010, 2010, : 193 - 197
  • [10] Co-training of Feature Extraction and Classification using Partitioned Convolutional Neural Networks
    Tsai, Wei-Yu
    Choi, Jinhang
    Parija, Tulika
    Gomatam, Priyanka
    Das, Chita
    Sampson, John
    Narayanan, Vijaykrishnan
    [J]. PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,