Feature weighting to tackle label dependencies in multi-label stacking nearest neighbor

被引:1
|
作者
Rastin, Niloofar [1 ]
Jahromi, Mansoor Zolghadri [1 ]
Taheri, Mohammad [1 ]
机构
[1] Shiraz Univ, Sch Elect & Comp Engn, Shiraz, Iran
关键词
Multi-label classification; Stacking; Meta binary relevance; Label correlations; Nearest neighbor; Feature weighting; CLASSIFICATION; COST;
D O I
10.1007/s10489-020-02073-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-label learning, each instance is associated with a subset of predefined labels. One common approach for multi-label classification has been proposed in Godbole and Sarawagi (2004) based on stacking which is called as Meta Binary Relevance (MBR). It uses two layers of binary models and feeds the outputs of the first layer to all binary models of the second layer. Hence, initial predicted class labels (in the first layer) are attached to the original features to have a new prediction of the classes in the second layer. To predict a specific label in the second layer, irrelevant labels are also used as the noisy features. This is why; Nearest Neighbor (NN) as a sensitive classifier to noisy features had been not, up to now, a proper base classifier in stacking method and all of its merits including simplicity, interpretability, global stability to noisy labels and good performance, are lost. As the first contribution, a popular feature weighting in NN classification is used here to solve uncorrelated labels problem. It tunes a parametric distance function by gradient descent to minimize the classification error on training data. However, it is known that some other objectives including F-measure are more suitable than classification error on learning imbalanced data. The second contribution of this paper is extending this method in order to improve F-measure. In our experimental study, the proposed method has been compared with and outperforms state-of-the-art multi-label classifiers in the literature.
引用
收藏
页码:5200 / 5218
页数:19
相关论文
共 50 条
  • [21] Partial Multi-label Learning with Label and Feature Collaboration
    Yu, Tingting
    Yu, Guoxian
    Wang, Jun
    Guo, Maozu
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT I, 2020, 12112 : 621 - 637
  • [22] Independent Feature and Label Components for Multi-label Classification
    Zhong, Yongjian
    Xu, Chang
    Du, Bo
    Zhang, Lefei
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 827 - 836
  • [23] Multi-label feature selection considering label supplementation
    Zhang, Ping
    Liu, Guixia
    Gao, Wanfu
    Song, Jiazhi
    PATTERN RECOGNITION, 2021, 120 (120)
  • [24] Exploiting Label Dependencies for Multi-Label Document Classification Using Transformers
    Fallah, Haytame
    Bruno, Emmanuel
    Bellot, Patrice
    Murisasco, Elisabeth
    PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2023, 2023,
  • [25] A k-Nearest Neighbor Based Multi-Instance Multi-Label Learning Algorithm
    Zhang, Min-Ling
    22ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2010), PROCEEDINGS, VOL 2, 2010, : 207 - 212
  • [26] A multi-label classification on topics of Indonesian news using K-Nearest Neighbor
    Isnaini, Nikmah
    Adiwijaya
    Mubarok, Mohamad Syahrul
    Abu Bakar, Muhammad Yuslan
    2ND INTERNATIONAL CONFERENCE ON DATA AND INFORMATION SCIENCE, 2019, 1192
  • [27] Semi-supervised multi-label image classification based on nearest neighbor editing
    Wei, Zhihua
    Wang, Hanli
    Zhao, Rui
    NEUROCOMPUTING, 2013, 119 : 462 - 468
  • [28] Contrastive Learning-Enhanced Nearest Neighbor Mechanism for Multi-Label Text Classification
    Su, Xi'ao
    Wang, Ran
    Dai, Xinyu
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 672 - 679
  • [29] Multi-label feature selection based on label correlations and feature redundancy
    Fan, Yuling
    Chen, Baihua
    Huang, Weiqin
    Liu, Jinghua
    Weng, Wei
    Lan, Weiyao
    KNOWLEDGE-BASED SYSTEMS, 2022, 241
  • [30] Distributed nearest neighbor classification for large-scale multi-label data on spark
    Gonzalez-Lopez, Jorge
    Ventura, Sebastian
    Cano, Alberto
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 87 : 66 - 82