Decision trees for hierarchical multi-label classification

被引:0
|
作者
Celine Vens
Jan Struyf
Leander Schietgat
Sašo Džeroski
Hendrik Blockeel
机构
[1] Katholieke Universiteit Leuven,Department of Computer Science
[2] Jožef Stefan Institute,Department of Knowledge Technologies
来源
Machine Learning | 2008年 / 73卷
关键词
Hierarchical classification; Multi-label classification; Decision trees; Functional genomics; Precision-recall analysis;
D O I
暂无
中图分类号
学科分类号
摘要
Hierarchical multi-label classification (HMC) is a variant of classification where instances may belong to multiple classes at the same time and these classes are organized in a hierarchy. This article presents several approaches to the induction of decision trees for HMC, as well as an empirical study of their use in functional genomics. We compare learning a single HMC tree (which makes predictions for all classes together) to two approaches that learn a set of regular classification trees (one for each class). The first approach defines an independent single-label classification task for each class (SC). Obviously, the hierarchy introduces dependencies between the classes. While they are ignored by the first approach, they are exploited by the second approach, named hierarchical single-label classification (HSC). Depending on the application at hand, the hierarchy of classes can be such that each class has at most one parent (tree structure) or such that classes may have multiple parents (DAG structure). The latter case has not been considered before and we show how the HMC and HSC approaches can be modified to support this setting. We compare the three approaches on 24 yeast data sets using as classification schemes MIPS’s FunCat (tree structure) and the Gene Ontology (DAG structure). We show that HMC trees outperform HSC and SC trees along three dimensions: predictive accuracy, model size, and induction time. We conclude that HMC trees should definitely be considered in HMC tasks where interpretable models are desired.
引用
收藏
页码:185 / 214
页数:29
相关论文
共 50 条
  • [1] Decision trees for hierarchical multi-label classification
    Vens, Celine
    Struyf, Jan
    Schietgat, Leander
    Dzeroski, Saso
    Blockeel, Hendrik
    [J]. MACHINE LEARNING, 2008, 73 (02) : 185 - 214
  • [2] Fuzzy Rough Decision Trees for Multi-label Classification
    Wang, Xiaoxue
    An, Shuang
    Shi, Hong
    Hu, Qinghua
    [J]. ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, RSFDGRC 2015, 2015, 9437 : 207 - 217
  • [3] Option Predictive Clustering Trees for Hierarchical Multi-label Classification
    Perdih, Tomaz Stepisnik
    Osojnik, Aljaz
    Dzeroski, Sao
    Kocev, Dragi
    [J]. DISCOVERY SCIENCE, DS 2017, 2017, 10558 : 116 - 123
  • [4] Predictive Bi-clustering Trees for Hierarchical Multi-label Classification
    Santos, Bruna Z.
    Nakano, Felipe K.
    Cerri, Ricardo
    Vens, Celine
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT III, 2021, 12459 : 701 - 718
  • [5] Learning Hierarchical Multi-label Classification Trees from Network Data
    Stojanova, Daniela
    Ceci, Michelangelo
    Malerba, Donato
    Dzeroski, Saso
    [J]. DISCOVERY SCIENCE, 2013, 8140 : 233 - 248
  • [6] Hierarchical Multi-Label Classification Networks
    Wehrmann, Jonatas
    Cerri, Ricardo
    Barros, Rodrigo C.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [7] ReliefF for Hierarchical Multi-label Classification
    Slavkov, Ivica
    Karcheska, Jana
    Kocev, Dragi
    Kalajdziski, Slobodan
    Dzeroski, Saso
    [J]. NEW FRONTIERS IN MINING COMPLEX PATTERNS, NFMCP 2013, 2014, 8399 : 148 - 161
  • [8] Semi-Supervised Predictive Clustering Trees for (Hierarchical) Multi-Label Classification
    Levatic, Jurica
    Ceci, Michelangelo
    Kocev, Dragi
    Dzeroski, Saso
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024
  • [9] The importance of the label hierarchy in hierarchical multi-label classification
    Jurica Levatić
    Dragi Kocev
    Sašo Džeroski
    [J]. Journal of Intelligent Information Systems, 2015, 45 : 247 - 271
  • [10] The importance of the label hierarchy in hierarchical multi-label classification
    Levatic, Jurica
    Kocev, Dragi
    Dzeroski, Saso
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2015, 45 (02) : 247 - 271