Decision trees for hierarchical multi-label classification

被引:0
|
作者
Celine Vens
Jan Struyf
Leander Schietgat
Sašo Džeroski
Hendrik Blockeel
机构
[1] Katholieke Universiteit Leuven,Department of Computer Science
[2] Jožef Stefan Institute,Department of Knowledge Technologies
来源
Machine Learning | 2008年 / 73卷
关键词
Hierarchical classification; Multi-label classification; Decision trees; Functional genomics; Precision-recall analysis;
D O I
暂无
中图分类号
学科分类号
摘要
Hierarchical multi-label classification (HMC) is a variant of classification where instances may belong to multiple classes at the same time and these classes are organized in a hierarchy. This article presents several approaches to the induction of decision trees for HMC, as well as an empirical study of their use in functional genomics. We compare learning a single HMC tree (which makes predictions for all classes together) to two approaches that learn a set of regular classification trees (one for each class). The first approach defines an independent single-label classification task for each class (SC). Obviously, the hierarchy introduces dependencies between the classes. While they are ignored by the first approach, they are exploited by the second approach, named hierarchical single-label classification (HSC). Depending on the application at hand, the hierarchy of classes can be such that each class has at most one parent (tree structure) or such that classes may have multiple parents (DAG structure). The latter case has not been considered before and we show how the HMC and HSC approaches can be modified to support this setting. We compare the three approaches on 24 yeast data sets using as classification schemes MIPS’s FunCat (tree structure) and the Gene Ontology (DAG structure). We show that HMC trees outperform HSC and SC trees along three dimensions: predictive accuracy, model size, and induction time. We conclude that HMC trees should definitely be considered in HMC tasks where interpretable models are desired.
引用
收藏
页码:185 / 214
页数:29
相关论文
共 50 条
  • [41] An Interactive Fusion Model for Hierarchical Multi-label Text Classification
    Zhao, Xiuhao
    Li, Zhao
    Zhang, Xianming
    Wang, Jibin
    Chen, Tong
    Ju, Zhengyu
    Wang, Canjun
    Zhang, Chao
    Zhan, Yiming
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 168 - 178
  • [42] Inducing Hierarchical Multi-label Classification rules with Genetic Algorithms
    Cerri, Ricardo
    Basgalupp, Marcio P.
    Barros, Rodrigo C.
    de Carvalho, Andre C. P. L. F.
    [J]. APPLIED SOFT COMPUTING, 2019, 77 : 584 - 604
  • [43] Hierarchical multi-label classification using local neural networks
    Cerri, Ricardo
    Barros, Rodrigo C.
    de Carvalho, Andre C. P. L. F.
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2014, 80 (01) : 39 - 56
  • [44] Hierarchical Multi-Label Classification with Partial Labels and Unknown Hierarchy
    Jo, Suhyeon
    Shin, DongHyeok
    Na, Byeonghu
    Jang, JoonHo
    Moon, Il-Chul
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 1025 - 1034
  • [45] Hierarchical text classification with multi-label contrastive learning and KNN
    Zhang, Jun
    Li, Yubin
    Shen, Fanfan
    He, Yueshun
    Tan, Hai
    He, Yanxiang
    [J]. NEUROCOMPUTING, 2024, 577
  • [46] Ontologue: Declarative Benchmark Construction for Hierarchical Multi-Label Classification
    Yang, Sean T.
    Herman, Bernease
    Howe, Bill
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [47] Label Expansion for Multi-Label Classification
    Rivolli, Adriano
    Soares, Carlos
    de Carvalho, Andre C. P. L. F.
    [J]. 2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 414 - 419
  • [48] Hierarchical multi-label classification based on LSTM network and Bayesian decision theory for LncRNA function prediction
    Shou Feng
    Huiying Li
    Jiaqing Qiao
    [J]. Scientific Reports, 12
  • [49] Hierarchical multi-label classification based on LSTM network and Bayesian decision theory for LncRNA function prediction
    Feng, Shou
    Li, Huiying
    Qiao, Jiaqing
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [50] Learning multi-label alternating decision trees from texts and data
    De Comité, F
    Gilleron, R
    Tommasi, M
    [J]. MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS, 2003, 2734 : 35 - 49