Rough set-based feature selection for weakly labeled data

被引:29
|
作者
Campagner, Andrea [1 ]
Ciucci, Davide [1 ]
Huellermeier, Eyke [2 ]
机构
[1] Univ Milano Bicocca, Dept Informat Syst & Commun, Viale Sarca 336, I-20126 Milan, Italy
[2] Univ Munich LMU, Inst Informat, Munich, Germany
关键词
Superset Learning; Rough Sets; Feature Selection; Evidence Theory; Entropy; DEMPSTER-SHAFER THEORY; TOTAL UNCERTAINTY; BELIEF FUNCTIONS; CLASSIFICATION; ENTROPY; INFORMATION; RULE;
D O I
10.1016/j.ijar.2021.06.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Supervised learning is an important branch of machine learning (ML), which requires a complete annotation (labeling) of the involved training data. This assumption is relaxed in the settings of weakly supervised learning, where labels are allowed to be imprecise or partial. In this article, we study the setting of superset learning, in which instances are assumed to be labeled with a set of possible annotations containing the correct one. We tackle the problem of learning from such data in the context of rough set theory (RST). More specifically, we consider the problem of RST-based feature reduction as a suitable means for data disambiguation, i.e., for the purpose of figuring out the most plausible precise instantiation of the imprecise training data. To this end, we define appropriate generalizations of decision tables and reducts, using tools from generalized information theory and belief function theory. Moreover, we analyze the computational complexity and theoretical properties of the associated computational problems. Finally, we present results of a series of experiments, in which we analyze the proposed concepts empirically and compare our methods with a state-of-the-art dimensionality reduction algorithm, reporting a statistically significant improvement in predictive accuracy. (C) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:150 / 167
页数:18
相关论文
共 50 条
  • [1] Rough set-based feature selection method
    Zhan, YM
    Zeng, XY
    Sun, JC
    [J]. PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2005, 15 (03) : 280 - 284
  • [2] Rough set-based feature selection method
    ZHAN Yanmei
    [J]. Progress in Natural Science:Materials International, 2005, (03) : 88 - 92
  • [3] A novel rough set-based feature selection method
    Xu, Yan
    Li, Jintao
    Wang, Bin
    Ding, Fan
    Sun, Chunming
    Wang, Xiaoleng
    [J]. RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 226 - 231
  • [4] Application of Rough Set-Based Feature Selection for Arabic Sentiment Analysis
    Qasem A. Al-Radaideh
    Ghufran Y. Al-Qudah
    [J]. Cognitive Computation, 2017, 9 : 436 - 445
  • [5] Application of Rough Set-Based Feature Selection for Arabic Sentiment Analysis
    Al-Radaideh, Qasem A.
    Al-Qudah, Ghufran Y.
    [J]. COGNITIVE COMPUTATION, 2017, 9 (04) : 436 - 445
  • [6] Rough set-based approach to feature selection in customer relationship management
    Tseng, Tzu-Liang
    Huang, Chun-Che
    [J]. OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2007, 35 (04): : 365 - 383
  • [7] A noise resistant dependency measure for rough set-based feature selection
    Javidi, Mohammad Masoud
    Eskandari, Sadegh
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2017, 33 (03) : 1613 - 1626
  • [8] RSFD: A rough set-based feature discretization method for meteorological data
    Zeng, Lirong
    Chen, Qiong
    Huang, Mengxing
    [J]. FRONTIERS IN ENVIRONMENTAL SCIENCE, 2022, 10
  • [9] Covering Rough Set-based Three-way Decision Feature Selection
    Ren, Mengyuan
    Qu, Yanpeng
    Deng, Ansheng
    [J]. PROCEEDINGS OF 2018 TENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2018, : 782 - 787
  • [10] Covering rough set-based incremental feature selection for mixed decision system
    Yang, Yanyan
    Chen, Degang
    Zhang, Xiao
    Ji, Zhenyan
    [J]. SOFT COMPUTING, 2022, 26 (06) : 2651 - 2669