Boosting Long-Tail Data Classification with Sparse Prototypical Networks

被引:0
|
作者
Figueroa, Alexei [1 ]
Papaioannou, Jens-Michalis [1 ,6 ]
Fallon, Conor [1 ]
Bekiaridou, Alexandra [2 ,3 ]
Bressem, Keno [4 ,5 ]
Zanos, Stavros [2 ,3 ]
Gers, Felix [1 ]
Nejdl, Wolfgang [6 ]
Loeser, Alexander [1 ]
机构
[1] Berliner Hsch Tech, DATEXIS, Berlin, Germany
[2] Elmezzi Grad Sch Mol Med, Manhasset, NY USA
[3] Northwell Hlth, Feinstein Inst Med Res, Manhasset, NY USA
[4] Tech Univ Munich, Dept Radiol, Klinikum Rechts Isar, Munich, Germany
[5] German Heart Ctr Munich, Dept Radiol & Nucl Med, Munich, Germany
[6] Leibniz Univ Hannover, L3S, Hannover, NH, Germany
关键词
Prototypical Networks; Sparsity; Long-Tail; NLP;
D O I
10.1007/978-3-031-70368-3_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clinical Decision Support Systems (CDSS) have become ubiquitous in healthcare facilities, leveraging the increasing presence of Electronic Health Records (EHR). Predicting clinical outcomes from clinical text, such as identifying diagnoses based on the admission state of patients, is among the core tasks that a CDSS must address. The state-of-the-art for this task has been set by transformer encoder models, recently superseded by encoders enhanced with a prototypical network. This task remains a significant challenge due to the substantial imbalance of the outcome labels, which is characterized by a long-tailed distribution where the majority of diagnoses are under-represented. Motivated by recent biologically inspired findings in deep learning, we propose S-Proto, a novel, efficient, and sparse prototypical layer. Our method achieves state-of-the-art performance in outcome diagnosis prediction, without compromising on the explainability characteristics of prototypical encoders. Quantitative results demonstrate that our approach is robust to the challenges presented by clinical notes, and transfers successfully to a second, unseen dataset. Qualitative evaluation with medical doctors shows that S-Proto is capable of disaggregating the representations of a disease that manifests differently in patient cohorts.
引用
收藏
页码:434 / 449
页数:16
相关论文
共 50 条
  • [1] Long-tail Effect on ECG Classification
    Thai, N. H.
    Nghia, N. T.
    Binh, D. V.
    Hai, N. T.
    Hung, N. M.
    [J]. 2017 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2017, : 34 - 38
  • [2] Identification and characterization of information-networks in long-tail data collections
    Elag, Mostafa M.
    Kumar, Praveen
    Marini, Luigi
    Myers, James D.
    Hedstrom, Margaret
    Plale, Beth A.
    [J]. ENVIRONMENTAL MODELLING & SOFTWARE, 2017, 94 : 100 - 111
  • [3] Retrieval Augmented Classification for Long-Tail Visual Recognition
    Long, Alexander
    Yin, Wei
    Ajanthan, Thalaiyasingam
    Vu Nguyen
    Purkait, Pulak
    Garg, Ravi
    Blair, Alan
    Shen, Chunhua
    van den Hengel, Anton
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6949 - 6959
  • [4] Long-Tail Hashing
    Chen, Yong
    Hou, Yuqing
    Leng, Shu
    Zhang, Qing
    Lin, Zhouchen
    Zhang, Dell
    [J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1328 - 1338
  • [5] Meta-learning Advisor Networks for Long-tail and Noisy Labels in Social Image Classification
    Ricci, Simone
    Uricchio, Tiberio
    Del Bimbo, Alberto
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (05)
  • [6] Long-tail Mixup for Extreme Multi-label Classification
    Han, Sangwoo
    Choi, Eunseong
    Lim, Chan
    Shim, Hyunjung
    Lee, Jongwuk
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3998 - 4002
  • [7] Integrating long-tail data: How far are we?
    Vanderbilt, Kristin
    Gries, Corinna
    [J]. ECOLOGICAL INFORMATICS, 2021, 64
  • [8] Long-tail buffer-content distributions in broadband networks
    Choudhury, GL
    Whitt, W
    [J]. PERFORMANCE EVALUATION, 1997, 30 (03) : 177 - 190
  • [9] Long-tail buffer-content distributions in broadband networks
    AT&T Lab, Holmdel, United States
    [J]. Perform Eval, 3 (177-190):
  • [10] THE TALE OF THE LONG-TAIL PAIR
    LIDGEY, J
    [J]. ELECTRONICS & WIRELESS WORLD, 1985, 91 (1595): : 74 - 76