Self-supervised multi-modal fusion network for multi-modal thyroid ultrasound image diagnosis

被引:21
|
作者
Xiang, Zhuo [1 ]
Zhuo, Qiuluan [2 ]
Zhao, Cheng [1 ]
Deng, Xiaofei [2 ]
Zhu, Ting [2 ]
Wang, Tianfu [1 ]
Jiang, Wei [2 ]
Lei, Baiying [1 ]
机构
[1] Shenzhen Univ, Natl Reg Key Technol Engn Lab Med Ultrasound, Guangdong Key Lab Biomed Measurements & Ultrasound, Sch Biomed Engn,Hlth Sci Ctr, Shenzhen, Peoples R China
[2] Huazhong Univ Sci & Technol, Union Shenzhen Hosp, Dept Ultrasound, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Thyroid ultrasound; Thyroid diagnosis; Multi-modal; Self-supervision; SHEAR-WAVE ELASTOGRAPHY; CONVOLUTIONAL NEURAL-NETWORK; INTRAOBSERVER REPRODUCIBILITY; NODULES; PERFORMANCE;
D O I
10.1016/j.compbiomed.2022.106164
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Ultrasound is a typical non-invasive diagnostic method often used to detect thyroid cancer lesions. However, due to the limitations of the information provided by ultrasound images, shear wave elastography (SWE) and color doppler ultrasound (CDUS) are also used clinically to assist in diagnosis, which makes the diagnosis time-consuming, labor-intensive, and highly subjective process. Therefore, automatic diagnosis of benign and ma-lignant thyroid nodules is beneficial for the clinical diagnosis of the thyroid. To this end, based on three mo-dalities of gray-scale ultrasound images(US), SWE, and CDUS, we propose a deep learning-based multi-modal feature fusion network for the automatic diagnosis of thyroid disease based on the ultrasound images. First, three ResNet18s initialized by self-supervised learning are used as branches to extract the image information of each modality, respectively. Then, a multi-modal multi-head attention branch is used to remove the common infor-mation of three modalities, and the knowledge of each modal is combined for thyroid diagnosis. At the same time, to better integrate the features between modalities, a multi-modal feature guidance module is also pro-posed to guide the feature extraction of each branch and reduce the difference between each-modal feature. We verify the multi-modal thyroid ultrasound image diagnosis method on the self-collected dataset, and the results prove that this method could provide fast and accurate assistance for sonographers in diagnosing thyroid nodules.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Self-Supervised Feature Learning via Exploiting Multi-Modal Data for Retinal Disease Diagnosis
    Li, Xiaomeng
    Jia, Mengyu
    Islam, Md Tauhidul
    Yu, Lequan
    Xing, Lei
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (12) : 4023 - 4033
  • [22] Self-Supervised Multi-Modal Learning for Collaborative Robotic Grasp-Throw
    Hou, Yanxu
    Fang, Zihan
    Li, Jun
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05): : 4250 - 4256
  • [23] Deep Self-Supervised t-SNE for Multi-modal Subspace Clustering
    Wang, Qianqian
    Xia, Wei
    Tao, Zhiqiang
    Gao, Quanxue
    Cao, Xiaochun
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1748 - 1755
  • [24] Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs
    Tao, Ruijie
    Lee, Kong Aik
    Das, Rohan Kumar
    Hautamaki, Ville
    Li, Haizhou
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1706 - 1719
  • [25] The Effectiveness of Self-supervised Pre-training for Multi-modal Endometriosis Classification
    Butler, David
    Wang, Hu
    Zhang, Yuan
    To, Minh-Son
    Condous, George
    Leonardi, Mathew
    Knox, Steven
    Avery, Jodie
    Hull, M. Louise
    Carneiro, Gustavo
    [J]. 2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [26] Deep Convolutional Neural Network for Multi-Modal Image Restoration and Fusion
    Deng, Xin
    Dragotti, Pier Luigi
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3333 - 3348
  • [27] Multi-modal network Protocols
    Balan, RK
    Akella, A
    Seshan, S
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2002, 32 (01) : 60 - 60
  • [28] Multi-modal feature fusion for geographic image annotation
    Li, Ke
    Zou, Changqing
    Bu, Shuhui
    Liang, Yun
    Zhang, Jian
    Gong, Minglun
    [J]. PATTERN RECOGNITION, 2018, 73 : 1 - 14
  • [29] A novel multi-modal medical image fusion algorithm
    Li, Xinhua
    Zhao, Jing
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (02) : 1995 - 2002
  • [30] Image and Encoded Text Fusion for Multi-Modal Classification
    Gallo, I.
    Calefati, A.
    Nawaz, S.
    Janjua, M. K.
    [J]. 2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 203 - 209