Severity of error in hierarchical datasets

被引:0
|
作者
Srivastava, Satwik [1 ]
Mishra, Deepak [2 ]
机构
[1] Indian Inst Technol Jodhpur, Dept Math, Jodhpur, India
[2] Indian Inst Technol Jodhpur, Dept Comp Sci & Engn, Jodhpur, India
关键词
D O I
10.1038/s41598-023-49185-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Classification tasks today, especially for the medical domain, use datasets which are often hierarchical. These tasks are approached using methods that consider the class taxonomy for predicting a label. The classifiers are gradually becoming increasingly accurate over the complex datasets. While increasing accuracy is a good way to judge a model, in high-risk applications, it needs to be ensured that even if the model makes a mistake, it does not bear a severe consequence. This work explores the concept of severity of an error and extends it to the medical domain. Further, it aims to point out that accuracy or AUROC alone are not sufficient metrics to decide the performance of a model in a setting where a misclassification will incur a severe cost. Various approaches to reduce severity for classification models are compared and evaluated in this work, which indicate that while many of them might be suited for a traditional image classification setting, there is a need for techniques tailored toward tasks and settings of medical domain to push artificial intelligence in healthcare to a deployable state.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Effective data summarization for hierarchical clustering in large datasets
    Patra, Bidyut Kr.
    Nandi, Sukumar
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 42 (01) : 1 - 20
  • [22] Hierarchical model-based clustering for large datasets
    Posse, C
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2001, 10 (03) : 464 - 486
  • [23] Do Datapoints Argue?: Argumentation for Hierarchical Agreement in Datasets
    Bahuguna, Ayush
    Haydar, Sajjad
    Brannstrom, Andreas
    Nieves, Juan Carlos
    [J]. ARTIFICIAL INTELLIGENCE-ECAI 2023 INTERNATIONAL WORKSHOPS, PT 2, XAI3, TACTIFUL, XI-ML, SEDAMI, RAAIT, AI4S, HYDRA, AI4AI, 2023, 2024, 1948 : 291 - 303
  • [24] Efficient Hierarchical Clustering of Large High Dimensional Datasets
    Gilpin, Sean
    Qian, Buyue
    Davidson, Ian
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1371 - 1380
  • [25] Topic-Constrained Hierarchical Clustering for Document Datasets
    Zhao, Ying
    [J]. ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 181 - 192
  • [26] Multilingual and hierarchical classification of large datasets of scientific publications
    Protasiewicz, Jaroslaw
    Stanislawek, Tomasz
    Dadas, Slawomir
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 1670 - 1675
  • [27] Hierarchical feature extraction for compact representation and classification of datasets
    Schubert, Markus
    Kohlmorgen, Jens
    [J]. NEURAL INFORMATION PROCESSING, PART I, 2008, 4984 : 556 - 565
  • [28] Likelihood approximation with hierarchical matrices for large spatial datasets
    Litvinenko, Alexander
    Sun, Ying
    Genton, Marc G.
    Keyes, David E.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 137 : 115 - 132
  • [29] DHC: A Distributed Hierarchical Clustering Algorithm for Large Datasets
    Zhang, Wei
    Zhang, Gongxuan
    Chen, Xiaohui
    Liu, Yueqi
    Zhou, Xiumin
    Zhou, Junlong
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2019, 28 (04)
  • [30] Hierarchical Aggregation Approach for Distributed clustering of spatial datasets
    Bendechache, Malika
    Le-Khac, Nhien-An
    Kechadi, M-Tahar
    [J]. 2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 1098 - 1103