Combination of Information in Labeled and Unlabeled Data via Evidence Theory

被引:4
|
作者
Huang L. [1 ]
机构
[1] Northwestern Polytechnical University, School of Automation, Xi'an,710072, China
来源
关键词
Bayes methods; Belief functions; Data mining; Data models; Evidence theory; evidence theory; evidential reasoning; fuzzy c-mean clustering; information fusion; pattern classification; Reliability theory; semi-supervised learning; Training; Training data; two views co-training;
D O I
10.1109/TAI.2023.3316194
中图分类号
学科分类号
摘要
For classification with few labeled and massive unlabeled patterns, co-training, which uses information in labeled and unlabeled data to classify query patterns, is often employed to train classifiers in two distinct views. The classifiers teach each other by adding high-confidence unlabeled patterns to training dataset of the other view. Whereas, the direct adding often leads to some negative influence when retraining classifiers because some patterns with wrong predictions are added into training dataset. The wrong predictions must be considered for performance improvement. To this end, we present a method called Combination of Information in Labeled and Unlabeled (CILU) data based on evidence theory to effectively extract and fuse complementary knowledge in labeled and unlabeled data. In CILU, patterns are characterized by two distinct views, and the unlabeled patterns with high-confidence predictions are first added into the other view. We can train two classifiers by few labeled training data and high-confidence unlabeled patterns in each view. The classifiers are fused by evidence theory, and their weights which aim to reduce the harmful influence of wrong predictions are learnt by constructing an objection function on labeled data. There exist some complementary information between two distinct views, so the fused classifiers in two views are also combined. In order to extract more useful information in unlabeled data, the semi-supervised Fuzzy C-mean clustering paradigm is then employed to yield clustering results. For a query pattern, the classification results and clustering results obtained by combined classifiers and clustering partition are integrated here to make the final class decision. IEEE
引用
收藏
页码:1 / 13
页数:12
相关论文
共 50 条
  • [1] Learning from labeled and unlabeled data
    Kothari, R
    Jain, V
    [J]. PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 2803 - 2808
  • [2] Labeled and unlabeled data in text categorization
    Silva, C
    Ribeiro, B
    [J]. 2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 2971 - 2976
  • [3] Feature extractions using labeled and unlabeled data
    Kuo, BC
    Shen, TW
    Chang, CH
    Hung, CC
    [J]. IGARSS 2005: IEEE International Geoscience and Remote Sensing Symposium, Vols 1-8, Proceedings, 2005, : 1257 - 1260
  • [4] Combining labeled and unlabeled data with graph embedding
    Zhao, Haitao
    [J]. NEUROCOMPUTING, 2006, 69 (16-18) : 2385 - 2389
  • [5] Combining labeled and unlabeled data for spam classification
    Yang, Zhen
    Wang, Jian
    Xu, Weiran
    Guo, Jun
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2007, 14 : 1476 - 1479
  • [6] Learning classification with both labeled and unlabeled data
    Vittaut, JN
    Amini, MR
    Gallinari, P
    [J]. MACHINE LEARNING: ECML 2002, 2002, 2430 : 468 - 479
  • [8] A multiclass boosting algorithm to labeled and unlabeled data
    Jafar Tanha
    [J]. International Journal of Machine Learning and Cybernetics, 2019, 10 : 3647 - 3665
  • [9] The Effect of Labeled/Unlabeled Prior Information for Masseter Segmentation
    Tabar, Yousef Rezaei
    Ulusoy, Ilkay
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013
  • [10] COMBINE LABELED AND UNLABELED INFORMATION FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Du, Qian
    Han, Deok
    Younan, Nicolas H.
    [J]. 2013 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2013, : 2581 - 2584