Improving large-scale hierarchical classification by rewiring: a data-driven filter based approach

被引:0
|
作者
Azad Naik
Huzefa Rangwala
机构
[1] Microsoft Corporation,
[2] George Mason University,undefined
关键词
Top-down hierarchical classification; Inconsistency; Error propagation; Flattening; Clustering; Rewiring;
D O I
暂无
中图分类号
学科分类号
摘要
Hierarchical Classification (HC) is a supervised learning problem where unlabeled instances are classified into a taxonomy of classes. Several methods that utilize the hierarchical structure have been developed to improve the HC performance. However, in most cases apriori defined hierarchical structure by domain experts is inconsistent; as a consequence performance improvement is not noticeable in comparison to flat classification methods. We propose a scalable data-driven filter based rewiring approach to modify an expert-defined hierarchy. Experimental comparisons of top-down hierarchical classification with our modified hierarchy, on a wide range of datasets shows classification performance improvement over the baseline hierarchy (i.e., defined by expert), clustered hierarchy and flattening based hierarchy modification approaches. In comparison to existing rewiring approaches, our developed method (rewHier) is computationally efficient, enabling it to scale to datasets with large numbers of classes, instances and features. We also show that our modified hierarchy leads to improved classification performance for classes with few training samples in comparison to flat and state-of-the-art hierarchical classification approaches. Source Code: https://cs.gmu.edu/~mlbio/TaxMod/
引用
收藏
页码:141 / 164
页数:23
相关论文
共 50 条
  • [41] Evaluation of large-scale cycling environment by using the trajectory data of dockless shared bicycles: A data-driven approach
    Ni, Ying
    Wang, Shihan
    Chen, Jiaqi
    Feng, Bufan
    Yu, Rongjie
    Cai, Yilin
    IET INTELLIGENT TRANSPORT SYSTEMS, 2024, : 1943 - 1961
  • [42] A data-driven layout optimization framework of large-scale wind farms based on machine learning
    Yang, Kun
    Deng, Xiaowei
    Ti, Zilong
    Yang, Shanghui
    Huang, Senbin
    Wang, Yuhang
    RENEWABLE ENERGY, 2023, 218
  • [43] Data-driven modelling of energy demand response behaviour based on a large-scale residential trial
    Antonopoulos, Ioannis
    Robu, Valentin
    Couraud, Benoit
    Flynn, David
    ENERGY AND AI, 2021, 4
  • [44] A data-driven distributed fault diagnosis scheme for large-scale systems based on correlation analysis
    Li, Zhennan
    Li, Linlin
    Ding, Steven X.
    IET CONTROL THEORY AND APPLICATIONS, 2024, 18 (02): : 201 - 212
  • [45] Data-driven causality digraph modeling of large-scale complex system based on transfer entropy
    Faghraoui, Ahmed
    Kabadi, Mohamed Ghassane
    Sauter, Dominique
    Boukhobza, Taha
    Aubrun, Christophe
    2014 IEEE CONFERENCE ON CONTROL APPLICATIONS (CCA), 2014, : 705 - 710
  • [46] A logical approach to data-driven classification
    Osswald, R
    Petersen, W
    KI 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2003, 2821 : 267 - 281
  • [47] Large-scale industrial energy systems optimization under uncertainty: A data-driven robust optimization approach
    Shen, Feifei
    Zhao, Liang
    Du, Wenli
    Zhong, Weimin
    Qian, Feng
    APPLIED ENERGY, 2020, 259 (259)
  • [48] A data-driven approach for collaborative optimization of large-scale electric vehicles considering energy consumption uncertainty
    Cheng, Xingxing
    Zhang, Rongquan
    Bu, Siqi
    ELECTRIC POWER SYSTEMS RESEARCH, 2023, 221
  • [49] A data-driven approach to anomaly detection and vulnerability dynamic analysis for large-scale integrated energy systems
    Zhang, Li
    Su, Huai
    Zio, Enrico
    Zhang, Zhien
    Chi, Lixun
    Fan, Lin
    Zhou, Jing
    Zhang, Jinjun
    ENERGY CONVERSION AND MANAGEMENT, 2021, 234
  • [50] JS']JSweep: A Patch-centric Data-driven Approach for Parallel Sweeps on Large-scale Meshes
    Yan, Jie
    Yang, Zhang
    Zhang, Aiqing
    Mo, Zeyao
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 776 - 785