Semi-supervised attribute reduction for hybrid data

被引:0
|
作者
Zhaowen Li
Jiali He
Pei Wang
Ching-Feng Wen
机构
[1] Nanning University,College of Information Engineering
[2] Yulin Normal University,Center for Applied Mathematics of Guangxi, Key Laboratory of Complex System Optimization and Big Data Processing in Department of Guangxi Education
[3] Kaohsiung Medical University,Center for Fundamental Science, Research Center for Nonlinear Analysis and Optimization
[4] Kaohsiung Medical University Hospital,Department of Medical Research
关键词
Partially labeled hybrid data; p-HIS; Semi-supervised attribute reduction; Indiscernibility relation; Dependence function.;
D O I
暂无
中图分类号
学科分类号
摘要
Due to the high cost of labelling data, a lot of partially hybrid data are existed in many practical applications. Uncertainty measure (UM) can supply new viewpoints for analyzing data. They can help us in disclosing the substantive characteristics of data. Although there are some UMs to evaluate the uncertainty of hybrid data, they cannot be trivially transplanted into partially hybrid data. The existing studies often replace missing labels with pseudo-labels, but pseudo-labels are not real labels. When encountering high label error rates, work will be difficult to sustain. In view of the above situation, this paper studies four UMs for partially hybrid data and proposed semi-supervised attribute reduction algorithms. A decision information system with partially labeled hybrid data (p-HIS) is first divided into two decision information systems: one is the decision information system with labeled hybrid data (l-HIS) and the other is the decision information system with unlabeled hybrid data (u-HIS). Then, four degrees of importance on a attribute subset in a p-HIS are defined based on indistinguishable relation, distinguishable relation, dependence function, information entropy and information amount. We discuss the difference and contact among these UMs. They are the weighted sum of l-HIS and u-HIS determined by the missing rate and can be considered as UMs of a p-HIS. Next, numerical experiments and statistical tests on 12 datasets verify the effectiveness of these UMs. Moreover, an adaptive semi-supervised attribute reduction algorithm of a p-HIS is proposed based on the selected important degrees, which can automatically adapt to various missing rates. Finally, the results of experiments and statistical tests on 12 datasets show the proposed algorithm is statistically better than some stat-of-the-art algorithms according to classification accuracy.
引用
收藏
相关论文
共 50 条
  • [1] Semi-supervised attribute reduction for hybrid data
    Li, Zhaowen
    He, Jiali
    Wang, Pei
    Wen, Ching-Feng
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (03)
  • [2] Semi-supervised attribute reduction via attribute indiscernibility
    Jianhua Dai
    Weisi Wang
    Chucai Zhang
    Shaojun Qu
    [J]. International Journal of Machine Learning and Cybernetics, 2023, 14 : 1445 - 1464
  • [3] Semi-supervised attribute reduction via attribute indiscernibility
    Dai, Jianhua
    Wang, Weisi
    Zhang, Chucai
    Qu, Shaojun
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (04) : 1445 - 1464
  • [4] Semi-supervised attribute reduction for interval data based on misclassification cost
    Jianhua Dai
    Qiong Liu
    [J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 1739 - 1750
  • [5] Semi-supervised attribute reduction for interval data based on misclassification cost
    Dai, Jianhua
    Liu, Qiong
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (06) : 1739 - 1750
  • [6] Uncertainty measurement of partially labeled categorical data with application to semi-supervised attribute reduction
    Pei Wang
    Qinli Zhang
    Witold Pedrycz
    Zhaowen Li
    Ching-Feng Wen
    [J]. Artificial Intelligence Review, 2023, 56 : 14731 - 14764
  • [7] Uncertainty measurement of partially labeled categorical data with application to semi-supervised attribute reduction
    Wang, Pei
    Zhang, Qinli
    Pedrycz, Witold
    Li, Zhaowen
    Wen, Ching-Feng
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (12) : 14731 - 14764
  • [8] Semi-supervised attribute reduction for partially labeled categorical data based on predicted label
    Huang, Dan
    Zhang, Qinli
    Li, Zhaowen
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2023, 154 : 242 - 261
  • [9] Semi-supervised attribute reduction based on label distribution and label irrelevance
    Dai, Jianhua
    Huang, Weiyi
    Wang, Weisi
    Zhang, Chucai
    [J]. INFORMATION FUSION, 2023, 100
  • [10] Semi-supervised Collective Classification in Multi-attribute Network Data
    Shaokai Wang
    Yunming Ye
    Xutao Li
    Xiaohui Huang
    Raymond Y. K. Lau
    [J]. Neural Processing Letters, 2017, 45 : 153 - 172