Joint Feature Selection and Classification for Positive Unlabelled Multi-Label Data Using Weighted Penalized Empirical Risk Minimization

被引:1
|
作者
TEISSEYRE, P. A. W. E. L. [1 ,2 ]
机构
[1] Polish Acad Sci, Inst Comp Sci, Jana Kazimierza 5, PL-01248 Warsaw, Poland
[2] Warsaw Univ Technol, Fac Math & Informat Sci, Koszykowa 75, PL-00062 Warsaw, Poland
关键词
positive and unlabelled data; multi-label classification; feature selection; empirical risk minimization;
D O I
10.34768/amcs-2022-0023
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the positive-unlabelled multi-label scenario in which multiple target variables are not observed directly. Instead, we observe surrogate variables indicating whether or not the target variables are labelled. The presence of a label means that the corresponding variable is positive. The absence of the label means that the variable can be either positive or negative. We analyze embedded feature selection methods based on two weighted penalized empirical risk minimization frameworks. In the first approach, we introduce weights of observations. The idea is to assign larger weights to observations for which there is a consistency between the values of the true target variable and the corresponding surrogate variable. In the second approach, we consider a weighted empirical risk function which corresponds to the risk function for the true unobserved target variables. The weights in both the methods depend on the unknown propensity score functions, whose estimation is a challenging problem. We propose to use very simple bounds for the propensity score, which leads to relatively simple forms of weights. In the experiments we analyze the predictive power of the methods considered for different labelling schemes.
引用
收藏
页码:311 / 322
页数:12
相关论文
共 50 条
  • [1] Multi-task Joint Feature Selection for Multi-label Classification
    He Zhifen
    Yang Ming
    Liu Huidong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (02) : 281 - 287
  • [2] Multi-task Joint Feature Selection for Multi-label Classification
    HE Zhifen
    YANG Ming
    LIU Huidong
    [J]. Chinese Journal of Electronics, 2015, 24 (02) : 281 - 287
  • [3] Joint multi-label classification and label correlations with missing labels and feature selection
    He, Zhi-Fen
    Yang, Ming
    Gao, Yang
    Liu, Hui-Dong
    Yin, Yilong
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 145 - 158
  • [4] Feature Selection for Multi-label Classification Using Neighborhood Preservation
    Cai, Zhiling
    Zhu, William
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2018, 5 (01) : 320 - 330
  • [5] Feature Selection for Multi-label Classification Using Neighborhood Preservation
    Zhiling Cai
    William Zhu
    [J]. IEEE/CAA Journal of Automatica Sinica, 2018, 5 (01) : 320 - 330
  • [6] Feature Selection for Hierarchical Multi-label Classification
    da Silva, Luan V. M.
    Cerri, Ricardo
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS XIX, IDA 2021, 2021, 12695 : 196 - 208
  • [7] Feature Selection for Multi-label Classification Problems
    Doquire, Gauthier
    Verleysen, Michel
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2011, PT I, 2011, 6691 : 9 - 16
  • [8] An Empirical Comparison Of Feature Selection Methods In Problem Transformation Multi-label Classification
    Rodriguez, J. M.
    Godoy, D.
    Zunino, A.
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2016, 14 (08) : 3784 - 3791
  • [9] Multi-Label Bioinformatics Data Classification With Ensemble Embedded Feature Selection
    Guo, Yumeng
    Chung, Fu-Lai
    Li, Guozheng
    Zhang, Lei
    [J]. IEEE ACCESS, 2019, 7 : 103863 - 103875
  • [10] Feature selection for multi-label classification using multivariate mutual information
    Lee, Jaesung
    Kim, Dae-Won
    [J]. PATTERN RECOGNITION LETTERS, 2013, 34 (03) : 349 - 357