Semi-supervised attribute reduction for hybrid data

被引:0
|
作者
Zhaowen Li
Jiali He
Pei Wang
Ching-Feng Wen
机构
[1] Nanning University,College of Information Engineering
[2] Yulin Normal University,Center for Applied Mathematics of Guangxi, Key Laboratory of Complex System Optimization and Big Data Processing in Department of Guangxi Education
[3] Kaohsiung Medical University,Center for Fundamental Science, Research Center for Nonlinear Analysis and Optimization
[4] Kaohsiung Medical University Hospital,Department of Medical Research
关键词
Partially labeled hybrid data; p-HIS; Semi-supervised attribute reduction; Indiscernibility relation; Dependence function.;
D O I
暂无
中图分类号
学科分类号
摘要
Due to the high cost of labelling data, a lot of partially hybrid data are existed in many practical applications. Uncertainty measure (UM) can supply new viewpoints for analyzing data. They can help us in disclosing the substantive characteristics of data. Although there are some UMs to evaluate the uncertainty of hybrid data, they cannot be trivially transplanted into partially hybrid data. The existing studies often replace missing labels with pseudo-labels, but pseudo-labels are not real labels. When encountering high label error rates, work will be difficult to sustain. In view of the above situation, this paper studies four UMs for partially hybrid data and proposed semi-supervised attribute reduction algorithms. A decision information system with partially labeled hybrid data (p-HIS) is first divided into two decision information systems: one is the decision information system with labeled hybrid data (l-HIS) and the other is the decision information system with unlabeled hybrid data (u-HIS). Then, four degrees of importance on a attribute subset in a p-HIS are defined based on indistinguishable relation, distinguishable relation, dependence function, information entropy and information amount. We discuss the difference and contact among these UMs. They are the weighted sum of l-HIS and u-HIS determined by the missing rate and can be considered as UMs of a p-HIS. Next, numerical experiments and statistical tests on 12 datasets verify the effectiveness of these UMs. Moreover, an adaptive semi-supervised attribute reduction algorithm of a p-HIS is proposed based on the selected important degrees, which can automatically adapt to various missing rates. Finally, the results of experiments and statistical tests on 12 datasets show the proposed algorithm is statistically better than some stat-of-the-art algorithms according to classification accuracy.
引用
收藏
相关论文
共 50 条
  • [41] Information gain-based semi-supervised feature selection for hybrid data
    Wenhao Shu
    Zhenchao Yan
    Jianhui Yu
    Wenbin Qian
    [J]. Applied Intelligence, 2023, 53 : 7310 - 7325
  • [42] Semi-supervised dimensionality reduction for analyzing high-dimensional data with constraints
    Yan, Su
    Bouaziz, Sofien
    Lee, Dongwon
    Barlow, Jesse
    [J]. NEUROCOMPUTING, 2012, 76 (01) : 114 - 124
  • [43] A Semi-Supervised Weighted Clustering Framework Facing to Hybrid Attributes Data Streams
    Chen, Xinquan
    [J]. 2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 5988 - 5993
  • [44] Semi-Supervised Learning with Data Augmentation for Tabular Data
    Fang, Junpeng
    Tang, Caizhi
    Cui, Qing
    Zhu, Feng
    Li, Longfei
    Zhou, Jun
    Zhu, Wei
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3928 - 3932
  • [45] Coupled dimensionality reduction and classification for supervised and semi-supervised multilabel learning
    Goenen, Mehmet
    [J]. PATTERN RECOGNITION LETTERS, 2014, 38 : 132 - 141
  • [46] GENERATE AND ADJUST: A NOVEL FRAMEWORK FOR SEMI-SUPERVISED PEDESTRIAN ATTRIBUTE RECOGNITION
    Shan, Xuebo
    Peng, Peixi
    Zhai, Yunpeng
    Zhang, Chong
    Huang, Tiejun
    Tian, Yonghong
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [47] Semi-Supervised Bayesian Attribute Learning for Person Re-Identification
    Liu, Wenhe
    Chang, Xiaojun
    Chen, Ling
    Yang, Yi
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7162 - 7169
  • [48] Semi-supervised Person Re-identification by Attribute Similarity Guidance
    Hong, Peixian
    Wu, Ancong
    Zheng, Wei-Shi
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6471 - 6477
  • [49] Incremental semi-supervised learning on streaming data
    Li, Yanchao
    Wang, Yongli
    Liu, Qi
    Bi, Cheng
    Jiang, Xiaohui
    Sun, Shurong
    [J]. PATTERN RECOGNITION, 2019, 88 : 383 - 396
  • [50] A semi-supervised clustering algorithm for data exploration
    Bouchachia, A
    Pedrycz, W
    [J]. FUZZY SETS AND SYSTEMS - IFSA 2003, PROCEEDINGS, 2003, 2715 : 328 - 337