MCICT: Graph convolutional network-based end-to-end model for multi-label classification of imbalanced clinical text

被引:0
|
作者
He, Yao [1 ,2 ]
Xiong, Qingyu [1 ,2 ]
Ke, Cai [2 ]
Wang, Yaqiang [3 ]
Yang, Zhengyi [2 ]
Yi, Hualing [2 ]
Fan, Qilin [2 ]
机构
[1] Chongqing Univ, Minist Educ, Key Lab Dependable Serv Comp Cyber Phys Soc, Chongqing 401331, Peoples R China
[2] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 401331, Peoples R China
[3] Chengdu Univ Informat Technol, Sch Software Engn, Chengdu 610225, Peoples R China
关键词
Multi-label text classification; Clinical text classification; Graph convolutional neural network; Label co-occurrence; Additional information;
D O I
10.1016/j.bspc.2023.105873
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
The rapid growth of clinical text data requires accurate and powerful automated classification methods to support medical decision making and personalized healthcare. The multi-label classification task for clinical texts is designed to assign the most relevant set of labels to each clinical text. However, this task presents two significant challenges: (1) how to accurately extract fine-grained semantic features from complex clinical texts, and (2) how to effectively mitigate the issue of label class imbalance. To overcome these problems, we innovatively propose a novel Multi-label Classification of Imbalanced Clinical Text (MCICIT) model. In order to obtain fine-grained semantic features from clinical texts, we utilize the specialized pre-trained language model BioBERT, tailored for biomedical texts. To tackle the challenge of label class imbalance, we present a Co-occurrence Based and Embeddings with Additional Information Enhanced Graph Convolutional Network (CoEAI-GCN) module. On one hand, we enrich the label content by incorporating additional information to acquire more accurate word embeddings as the feature matrix. On the other hand, we combine the cooccurrence relationship of labels to construct a correlation matrix. Ultimately, label representations are learned through a graph convolutional network. By conducting multi-label classification experiments on two clinical text datasets extracted from real medical systems, our model achieves a 3.2% and 0.5% improvement in F1 scores, respectively, compared to state -of -the -art deep learning models. Additionally, we conduct ablation studies to explore the behaviors of the proposed model. These results together demonstrate that our proposed MCICT effectively enhances the classification performance of imbalanced clinical texts.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Label Correlation Based Graph Convolutional Network for Multi-label Text Classification
    Huy-The Vu
    Minh-Tien Nguyen
    Van-Chien Nguyen
    Manh-Tran Tien
    Van-Hau Nguyen
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [2] Label-representative graph convolutional network for multi-label text classification
    Huy-The Vu
    Minh-Tien Nguyen
    Van-Chien Nguyen
    Minh-Hieu Pham
    Van-Quyet Nguyen
    Van-Hau Nguyen
    [J]. Applied Intelligence, 2023, 53 : 14759 - 14774
  • [3] Label-representative graph convolutional network for multi-label text classification
    Huy-The Vu
    Minh-Tien Nguyen
    Van-Chien Nguyen
    Minh-Hieu Pham
    Van-Quyet Nguyen
    Van-Hau Nguyen
    [J]. APPLIED INTELLIGENCE, 2023, 53 (12) : 14759 - 14774
  • [4] Multi-label text classification based on semantic-sensitive graph convolutional network
    Zeng, Delong
    Zha, Enze
    Kuang, Jiayi
    Shen, Ying
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 284
  • [5] Dual-view graph convolutional network for multi-label text classification
    Li, Xiaohong
    You, Ben
    Peng, Qixuan
    Feng, Shaojie
    [J]. APPLIED INTELLIGENCE, 2024, 54 (19) : 9363 - 9380
  • [6] An End-to-End Multi-label classification model for Arrhythmia based on varied-length ECG signals
    Dong, Yanfang
    Cai, Wenqiang
    Zhu, Wenliang
    Wang, Lirong
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 1149 - 1154
  • [7] Vertebrae Labeling via End-to-End Integral Regression Localization and Multi-Label Classification Network
    Qin, Chunli
    Zhou, Ji
    Yao, Demin
    Zhuang, Han
    Wang, Hui
    Chen, Shiyao
    Shi, Yonghong
    Song, Zhijian
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (06) : 2726 - 2736
  • [8] End-to-End Probabilistic Label-Specific Feature Learning for Multi-Label Classification
    Hang, Jun-Yi
    Zhang, Min-Ling
    Feng, Yanghe
    Song, Xiaocheng
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6847 - 6855
  • [9] Multi-label classification of fundus images based on graph convolutional network
    Cheng, Yinlin
    Ma, Mengnan
    Li, Xingyu
    Zhou, Yi
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (SUPPL 2)
  • [10] Multi-label classification of fundus images based on graph convolutional network
    Yinlin Cheng
    Mengnan Ma
    Xingyu Li
    Yi Zhou
    [J]. BMC Medical Informatics and Decision Making, 21