Rough set-based feature selection for weakly labeled data

被引:29
|
作者
Campagner, Andrea [1 ]
Ciucci, Davide [1 ]
Huellermeier, Eyke [2 ]
机构
[1] Univ Milano Bicocca, Dept Informat Syst & Commun, Viale Sarca 336, I-20126 Milan, Italy
[2] Univ Munich LMU, Inst Informat, Munich, Germany
关键词
Superset Learning; Rough Sets; Feature Selection; Evidence Theory; Entropy; DEMPSTER-SHAFER THEORY; TOTAL UNCERTAINTY; BELIEF FUNCTIONS; CLASSIFICATION; ENTROPY; INFORMATION; RULE;
D O I
10.1016/j.ijar.2021.06.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Supervised learning is an important branch of machine learning (ML), which requires a complete annotation (labeling) of the involved training data. This assumption is relaxed in the settings of weakly supervised learning, where labels are allowed to be imprecise or partial. In this article, we study the setting of superset learning, in which instances are assumed to be labeled with a set of possible annotations containing the correct one. We tackle the problem of learning from such data in the context of rough set theory (RST). More specifically, we consider the problem of RST-based feature reduction as a suitable means for data disambiguation, i.e., for the purpose of figuring out the most plausible precise instantiation of the imprecise training data. To this end, we define appropriate generalizations of decision tables and reducts, using tools from generalized information theory and belief function theory. Moreover, we analyze the computational complexity and theoretical properties of the associated computational problems. Finally, we present results of a series of experiments, in which we analyze the proposed concepts empirically and compare our methods with a state-of-the-art dimensionality reduction algorithm, reporting a statistically significant improvement in predictive accuracy. (C) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:150 / 167
页数:18
相关论文
共 50 条
  • [31] PSO-based feature selection and neighborhood rough set-based classification for BCI multiclass motor imagery task
    Kumar, S. Udhaya
    Inbarani, H. Hannah
    [J]. NEURAL COMPUTING & APPLICATIONS, 2017, 28 (11): : 3239 - 3258
  • [32] A fuzzy rough set-based undersampling approach for imbalanced data
    Zhang, Xiao
    He, Zhaoqian
    Yang, Yanyan
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (07) : 2799 - 2810
  • [33] PSO-based feature selection and neighborhood rough set-based classification for BCI multiclass motor imagery task
    S. Udhaya Kumar
    H. Hannah Inbarani
    [J]. Neural Computing and Applications, 2017, 28 : 3239 - 3258
  • [34] Rough set-based intelligent agent grid data management
    Chen, Jia
    Liu, Di
    [J]. 2007 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1 AND 2: VOL 1: COMMUNICATION THEORY AND SYSTEMS; VOL 2: SIGNAL PROCESSING, COMPUTATIONAL INTELLIGENCE, CIRCUITS AND SYSTEMS, 2007, : 937 - +
  • [35] ROUGH SET-BASED DESIGN RULE SELECTION FOR COLLABORATIVE ASSEMBLY DESIGN
    Kim, Kyoung-Yun
    Choi, Keunho
    [J]. DETC 2008: PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, VOL 1, PTS A AND B: 34TH DESIGN AUTOMATION CONFERENCE, 2009, : 53 - 59
  • [36] Degrees of conditional (in)dependence: A framework for approximate Bayesian networks and examples related to the rough set-based feature selection
    Slezak, Dominik
    [J]. INFORMATION SCIENCES, 2009, 179 (03) : 197 - 209
  • [37] Intelligent temporal classification and fuzzy rough set-based feature selection algorithm for intrusion detection system in WSNs
    Selvakumar, K.
    Karuppiah, Marimuthu
    SaiRamesh, L.
    Islam, S. K. Hafizul
    Hassan, Mohammad Mehedi
    Fortino, Giancarlo
    Choo, Kim-Kwang Raymond
    [J]. INFORMATION SCIENCES, 2019, 497 : 77 - 90
  • [38] Feature selection based on rough set and information entropy
    Han, JC
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2005, : 153 - 158
  • [39] A Rough Set Based Hybrid Method to Feature Selection
    Ming, He
    [J]. KAM: 2008 INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING, PROCEEDINGS, 2008, : 585 - 588
  • [40] A rough set-based CBR approach for feature and document reduction in text categorization
    Li, Y
    Shiu, SCK
    Pal, SK
    Liu, JNK
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 2438 - 2443