ML-FOREST: A Multi-Label Tree Ensemble Method for Multi-Label Classification

被引:68
|
作者
Wu, Qingyao [1 ]
Tan, Mingkui [1 ]
Song, Hengjie [1 ]
Chen, Jian [1 ]
Ng, Michael K. [2 ]
机构
[1] South China Univ Technol, Sch Software Engn, Guangzhou 510641, Guangdong, Peoples R China
[2] Hong Kong Baptist Univ, Dept Math, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label classification; label dependency; label transfer; tree classifier; ensemble methods;
D O I
10.1109/TKDE.2016.2581161
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label classification deals with the problem where each example is associated with multiple class labels. Since the labels are often dependent to other labels, exploiting label dependencies can significantly improve the multi-label classification performance. The label dependency in existing studies is often given as prior knowledge or learned from the labels only. However, in many real applications, such prior knowledge may not be available, or labeled information might be very limited. In this paper, we propose a new algorithm, called ML-FOREST, to learn an ensemble of hierarchical multi-label classifier trees to reveal the intrinsic label dependencies. In ML-FOREST, we construct a set of hierarchical trees, and develop a label transfer mechanism to identify the multiple relevant labels in a hierarchical way. In general, the relevant labels at higher levels of the trees capture more discriminable label concepts, and they will be transferred into lower level children nodes that are harder to discriminate. The relevant labels in the hierarchy are then aggregated to compute label dependency and make the final prediction. Our empirical study shows encouraging results of the proposed algorithm in comparison with the state-of-the-art multi-label classification algorithms under Friedman test and post-hoc Nemenyi test.
引用
收藏
页码:2665 / 2680
页数:16
相关论文
共 50 条
  • [41] Ensemble of classifier chains and decision templates for multi-label classification
    Rocha, Victor Freitas
    Varejao, Flavio Miguel
    Vieira Segatto, Marcelo Eduardo
    KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (03) : 643 - 663
  • [42] GENERALIZED K-LABELSET ENSEMBLE FOR MULTI-LABEL CLASSIFICATION
    Lo, Hung-Yi
    Lin, Shou-De
    Wang, Hsin-Min
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2061 - 2064
  • [43] An Efficient Multi-Label Classification System Using Ensemble of Classifiers
    Chandran, Shilpa A.
    Panicker, Janu R.
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, INSTRUMENTATION AND CONTROL TECHNOLOGIES (ICICICT), 2017, : 1133 - 1136
  • [44] Multi-Label Learning with Deep Forest
    Yang, Liang
    Wu, Xi-Zhu
    Jiang, Yuan
    Zhou, Zhi-Hua
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1634 - 1641
  • [45] Ensemble of classifier chains and decision templates for multi-label classification
    Victor Freitas Rocha
    Flávio Miguel Varejão
    Marcelo Eduardo Vieira Segatto
    Knowledge and Information Systems, 2022, 64 : 643 - 663
  • [46] Multilabel classification using heterogeneous ensemble of multi-label classifiers
    Tahir, Muhammad Atif
    Kittler, Josef
    Bouridane, Ahmed
    PATTERN RECOGNITION LETTERS, 2012, 33 (05) : 513 - 523
  • [47] Multi-Label ECG Signal Classification Based on Ensemble Classifier
    Sun, Zhanquan
    Wang, Chaoli
    Zhao, Yangyang
    Yan, Chao
    IEEE ACCESS, 2020, 8 : 117986 - 117996
  • [48] Active k-labelsets ensemble for multi-label classification
    Wang, Ran
    Kwong, Sam
    Wang, Xu
    Jia, Yuheng
    PATTERN RECOGNITION, 2021, 109
  • [49] Multi-label Text Classification Method Based on Label Semantic Information
    Xiao L.
    Chen B.-L.
    Huang X.
    Liu H.-F.
    Jing L.-P.
    Yu J.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (04): : 1079 - 1089
  • [50] An Ensemble Embedded Feature Selection Method for Multi-Label Clinical Text Classification
    Guo, Yumeng
    Chung, Fulai
    Li, Guozheng
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 823 - 826