Combination of Information in Labeled and Unlabeled Data via Evidence Theory

被引:4
|
作者
Huang L. [1 ]
机构
[1] Northwestern Polytechnical University, School of Automation, Xi'an
来源
关键词
Belief functions; evidence theory (ET); evidential reasoning; fuzzy c-mean (FCM) clustering; information fusion; pattern classification; semisupervised learning (SSL); two-views co-training;
D O I
10.1109/TAI.2023.3316194
中图分类号
学科分类号
摘要
For classification with few labeled and massive unlabeled patterns, co-training, which uses information in labeled and unlabeled data to classify query patterns, is often employed to train classifiers in two distinct views. The classifiers teach each other by adding high-confidence unlabeled patterns to training dataset of the other view. Whereas, the direct adding often leads to some negative influence when retraining classifiers because some patterns with wrong predictions are added into training dataset. The wrong predictions must be considered for performance improvement. To this end, we present a method called Combination of Information in Labeled and Unlabeled (CILU) data based on evidence theory to effectively extract and fuse complementary knowledge in labeled and unlabeled data. In CILU, patterns are characterized by two distinct views, and the unlabeled patterns with high-confidence predictions are first added into the other view. We can train two classifiers by few labeled training data and high-confidence unlabeled patterns in each view. The classifiers are fused by evidence theory, and their weights which aim to reduce the harmful influence of wrong predictions are learnt by constructing an objection function on labeled data. There exist some complementary information between two distinct views, so the fused classifiers in two views are also combined. In order to extract more useful information in unlabeled data, semi-supervised Fuzzy C-mean clustering paradigm is also employed to yield clustering results. For a query pattern, the classification results and clustering results obtained by combined classifiers and clustering partition are integrated to make final class decision. © 2023 IEEE.
引用
收藏
页码:2179 / 2192
页数:13
相关论文
共 50 条
  • [41] Efficient heuristics for learning Bayesian network from labeled and unlabeled data
    Duan, Zhiyi
    Wang, Limin
    Sun, Minghui
    INTELLIGENT DATA ANALYSIS, 2020, 24 (02) : 385 - 408
  • [42] Learning Instance Weighted Naive Bayes from labeled and unlabeled data
    Liangxiao Jiang
    Journal of Intelligent Information Systems, 2012, 38 : 257 - 268
  • [43] Local Adaptive Projection Framework for Feature Selection of Labeled and Unlabeled Data
    Chen, Xiaojun
    Yuan, Guowen
    Wang, Wenting
    Nie, Feiping
    Chang, Xiaojun
    Huang, Joshua Zhexue
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (12) : 6362 - 6373
  • [44] Graph-based boosting algorithm to learn labeled and unlabeled data
    Liu, Zheng
    Jin, Wei
    Mu, Ying
    PATTERN RECOGNITION, 2020, 106
  • [45] Discriminative clustering with representation learning with any ratio of labeled to unlabeled data
    Corinne Jones
    Vincent Roulet
    Zaid Harchaoui
    Statistics and Computing, 2022, 32
  • [46] Combining labeled and unlabeled data for text classification with a large number of categories
    Ghani, R
    2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 597 - 598
  • [47] Learning from labeled and unlabeled data using a minimal number of queries
    Kothari, R
    Jain, V
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (06): : 1496 - 1505
  • [48] Combine labeled and unlabeled data for immune detector training with label propagation
    Chen Wen
    Wang Changzhi
    KNOWLEDGE-BASED SYSTEMS, 2022, 236
  • [49] Learning bayesian multinets from labeled and unlabeled data for knowledge representation
    Pang, Meng
    Wang, Limin
    Li, Qilong
    Lu, Guo
    Li, Kuo
    INTELLIGENT DATA ANALYSIS, 2023, 27 (06) : 1699 - 1723
  • [50] Discriminative clustering with representation learning with any ratio of labeled to unlabeled data
    Jones, Corinne
    Roulet, Vincent
    Harchaoui, Zaid
    STATISTICS AND COMPUTING, 2022, 32 (01)