Tri-training and data editing based semi-supervised clustering algorithm

被引:0
|
作者
Deng, Chao [1 ]
Guo, Mao Zu [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Postfach 15 00 01, Harbin, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Seeds based semi-supervised clustering algorithms often utilize a seeds set consisting of a small amount of labeled data to initialize cluster centroids, hence improve the performance of clustering over whole data set. Researches indicate that both the scale and quality of seeds set greatly restrict the performance of semi-supervised clustering. A novel semi-supervised clustering algorithm named DE-Tri-training semi-supervised K means is proposed. In new algorithm, prior to initializing cluster centroids, the training process of a semisupervised classification approach named Tri-training is used to label the unlabeled data and add them into initial seeds to enlarge the scale. Meanwhile, to improve the quality of enlarged seeds set, a Nearest Neighbor Rule based data editing technique named Depuration is introduced into the Tri-training process to eliminate and correct the noise and mislabeled data among the enlarged seeds. Experiments show that novel algorithm can effectively improve the initialization of cluster centroids and enhance clustering performance.
引用
收藏
页码:641 / +
页数:2
相关论文
共 50 条
  • [1] Tri-training and data editing based semi-supervised clustering algorithm
    Deng, Chao
    Guo, Mao-Zu
    [J]. Ruan Jian Xue Bao/Journal of Software, 2008, 19 (03): : 663 - 673
  • [2] A Novel Semi-supervised SVM Based on Tri-training
    Li, KunLun
    Zhang, Wei
    Ma, Xiaotao
    Cao, Zheng
    Zhang, Chao
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL III, PROCEEDINGS, 2008, : 47 - +
  • [3] Semi-supervised active learning algorithm for SVMs based on QBC and tri-training
    Hailong Xu
    Longyue Li
    Pengsong Guo
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 8809 - 8822
  • [4] Semi-supervised active learning algorithm for SVMs based on QBC and tri-training
    Xu, Hailong
    Li, Longyue
    Guo, Pengsong
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (09) : 8809 - 8822
  • [5] Classification of Hyperspectral Data Based on Semi-supervised Tri-training Learning Framework
    Huang, Rui
    Zhou, Lina
    [J]. ADVANCED MATERIALS IN MICROWAVES AND OPTICS, 2012, 500 : 374 - 382
  • [6] SEMI-SUPERVISED ACOUSTIC EVENT DETECTION BASED ON TRI-TRAINING
    Shi, Bowen
    Sun, Ming
    Kao, Chieh-Chi
    Rozgic, Viktor
    Matsoukas, Spyros
    Wang, Chao
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 750 - 754
  • [7] Semi-supervised PolSAR Classification Based on Improved Tri-training
    Hua, Wenqiang
    Wang, Shuang
    Zhao, Yang
    Yue, Bo
    Guo, Yanhe
    [J]. 2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 3937 - 3940
  • [8] Deep Tri-Training for Semi-Supervised Image Segmentation
    An, Shan
    Zhu, Haogang
    Zhang, Jiaao
    Ye, Junjie
    Wang, Siliang
    Yin, Jianqin
    Zhang, Hong
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10097 - 10104
  • [9] Semi-supervised active learning image classification method based on Tri-Training algorithm
    Zhang, Yongjun
    Yan, Siyu
    [J]. PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS), 2020, : 206 - 210
  • [10] Semi-supervised patent text classification method based on improved Tri-training algorithm
    Hu, Yun-Qing
    Qiu, Qing-Ying
    Yu, Xiu
    Wu, Jian-Wei
    [J]. Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2020, 54 (02): : 331 - 339