AN EXTENSIVE EVALUATION OF DECISION TREE-BASED HIERARCHICAL MULTILABEL CLASSIFICATION METHODS AND PERFORMANCE MEASURES

被引:18
|
作者
Cerri, Ricardo [1 ]
Pappa, Gisele L. [2 ]
Carvalho, Andre Carlos P. L. F. [1 ]
Freitas, Alex A. [3 ]
机构
[1] Univ Sao Paulo, Dept Ciencias Comp, Campus Sao Carlos, BR-13560970 Sao Carlos, SP, Brazil
[2] Univ Fed Minas Gerais, Dept Ciencias Comp, Belo Horizonte, MG, Brazil
[3] Univ Kent, Sch Comp, Canterbury, Kent, England
基金
巴西圣保罗研究基金会;
关键词
hierarchical; multilabel; classification; performance measures; global and local approaches; GENE ONTOLOGY; TOOL;
D O I
10.1111/coin.12011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical multilabel classification is a complex classification problem where an instance can be assigned to more than one class simultaneously, and these classes are hierarchically organized with superclasses and subclasses, that is, an instance can be classified as belonging to more than one path in the hierarchical structure. This article experimentally analyses the behavior of different decision tree-based hierarchical multilabel classification methods based on the local and global classification approaches. The approaches are compared using distinct hierarchy-based and distance-based evaluation measures, when they are applied to a variation of real multilabel and hierarchical datasets' characteristics. Also, the different evaluation measures investigated are compared according to their degrees of consistency, discriminancy, and indifferency. As a result of the experimental analysis, we recommend the use of the global classification approach and suggest the use of the Hierarchical Precision and Hierarchical Recall evaluation measures.
引用
收藏
页码:1 / 46
页数:46
相关论文
共 50 条
  • [41] Decision trees for hierarchical multilabel classification: A case study in functional genomics
    Blockeel, Hendrik
    Schietgat, Leander
    Struyf, Jan
    Dzeroski, Saso
    Clare, Amanda
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2006, PROCEEDINGS, 2006, 4213 : 18 - 29
  • [42] Tree-based methods for tumor classification with gene expression microarray.
    Zhang, HP
    Yu, CY
    Singer, BH
    Xiong, MM
    AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 69 (04) : 243 - 243
  • [43] Classification of repeated measurements data using tree-based ensemble methods
    Adler, Werner
    Potapov, Sergej
    Lausen, Berthold
    COMPUTATIONAL STATISTICS, 2011, 26 (02) : 355 - 369
  • [44] A Memetic Genetic Programming with Decision Tree-based Local Search for Classification Problems
    Wang, Pu
    Tang, Ke
    Tsang, Edward P. K.
    Yao, Xin
    2011 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2011, : 917 - 924
  • [45] Effectiveness of frequency domain parameters in decision tree-based classification of GMAW defects
    Rajan, Thomas P.
    Sundareswaran, K.
    Venkateswaran, P. R.
    INTERNATIONAL JOURNAL OF MANUFACTURING RESEARCH, 2023, 18 (03) : 251 - 265
  • [46] MIPART: A Partial Decision Tree-Based Method for Multiple-Instance Classification
    Balbal, Kadriye Filiz
    Applied Sciences (Switzerland), 2024, 14 (24):
  • [47] Suite of decision tree-based classification algorithms on cancer gene expression data
    Al Snousy, Mohmad Badr
    El-Deeb, Hesham Mohamed
    Badran, Khaled
    Al Khlil, Ibrahim Ali
    EGYPTIAN INFORMATICS JOURNAL, 2011, 12 (02) : 73 - 82
  • [48] An Advanced Decision Tree-Based Deep Neural Network in Nonlinear Data Classification
    Arifuzzaman, Mohammad
    Hasan, Md. Rakibul
    Toma, Tasnia Jahan
    Hassan, Samia Binta
    Paul, Anup Kumar
    TECHNOLOGIES, 2023, 11 (01)
  • [49] Tree-based classification of tabla strokes
    Deolekar, Subodh
    Abraham, Siby
    CURRENT SCIENCE, 2018, 115 (09): : 1724 - 1731
  • [50] Decision Tree-Based Approach for Defect Detection and Classification in Oil and Gas Pipelines
    Mohamed, Abduljalil
    Hamdi, Mohamed Salah
    Tahar, Sofiene
    PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2018, VOL 1, 2019, 880 : 490 - 504