Self-supervised multi-modal fusion network for multi-modal thyroid ultrasound image diagnosis

被引：21

作者：

Xiang, Zhuo ^{[1
]}

Zhuo, Qiuluan ^{[2
]}

Zhao, Cheng ^{[1
]}

Deng, Xiaofei ^{[2
]}

Zhu, Ting ^{[2
]}

Wang, Tianfu ^{[1
]}

Jiang, Wei ^{[2
]}

Lei, Baiying ^{[1
]}

机构：

[1] Shenzhen Univ, Natl Reg Key Technol Engn Lab Med Ultrasound, Guangdong Key Lab Biomed Measurements & Ultrasound, Sch Biomed Engn,Hlth Sci Ctr, Shenzhen, Peoples R China

[2] Huazhong Univ Sci & Technol, Union Shenzhen Hosp, Dept Ultrasound, Shenzhen, Peoples R China

来源：

COMPUTERS IN BIOLOGY AND MEDICINE | 2022年 / 150卷

基金：

中国国家自然科学基金;

关键词：

Thyroid ultrasound; Thyroid diagnosis; Multi-modal; Self-supervision; SHEAR-WAVE ELASTOGRAPHY; CONVOLUTIONAL NEURAL-NETWORK; INTRAOBSERVER REPRODUCIBILITY; NODULES; PERFORMANCE;

D O I：

10.1016/j.compbiomed.2022.106164

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Ultrasound is a typical non-invasive diagnostic method often used to detect thyroid cancer lesions. However, due to the limitations of the information provided by ultrasound images, shear wave elastography (SWE) and color doppler ultrasound (CDUS) are also used clinically to assist in diagnosis, which makes the diagnosis time-consuming, labor-intensive, and highly subjective process. Therefore, automatic diagnosis of benign and ma-lignant thyroid nodules is beneficial for the clinical diagnosis of the thyroid. To this end, based on three mo-dalities of gray-scale ultrasound images(US), SWE, and CDUS, we propose a deep learning-based multi-modal feature fusion network for the automatic diagnosis of thyroid disease based on the ultrasound images. First, three ResNet18s initialized by self-supervised learning are used as branches to extract the image information of each modality, respectively. Then, a multi-modal multi-head attention branch is used to remove the common infor-mation of three modalities, and the knowledge of each modal is combined for thyroid diagnosis. At the same time, to better integrate the features between modalities, a multi-modal feature guidance module is also pro-posed to guide the feature extraction of each branch and reduce the difference between each-modal feature. We verify the multi-modal thyroid ultrasound image diagnosis method on the self-collected dataset, and the results prove that this method could provide fast and accurate assistance for sonographers in diagnosing thyroid nodules.

引用

页数：8

共 50 条

[21] Self-Supervised Feature Learning via Exploiting Multi-Modal Data for Retinal Disease Diagnosis
Li, Xiaomeng
Jia, Mengyu
Islam, Md Tauhidul
Yu, Lequan
Xing, Lei
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (12) : 4023 - 4033
[22] Self-Supervised Multi-Modal Learning for Collaborative Robotic Grasp-Throw
Hou, Yanxu
Fang, Zihan
Li, Jun
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05): : 4250 - 4256
[23] Deep Self-Supervised t-SNE for Multi-modal Subspace Clustering
Wang, Qianqian
Xia, Wei
Tao, Zhiqiang
Gao, Quanxue
Cao, Xiaochun
[J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1748 - 1755
[24] Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs
Tao, Ruijie
Lee, Kong Aik
Das, Rohan Kumar
Hautamaki, Ville
Li, Haizhou
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1706 - 1719
[25] The Effectiveness of Self-supervised Pre-training for Multi-modal Endometriosis Classification
Butler, David
Wang, Hu
Zhang, Yuan
To, Minh-Son
Condous, George
Leonardi, Mathew
Knox, Steven
Avery, Jodie
Hull, M. Louise
Carneiro, Gustavo
[J]. 2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
[26] Deep Convolutional Neural Network for Multi-Modal Image Restoration and Fusion
Deng, Xin
Dragotti, Pier Luigi
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3333 - 3348
[27] Multi-modal network Protocols
Balan, RK
Akella, A
Seshan, S
[J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2002, 32 (01) : 60 - 60
[28] Multi-modal feature fusion for geographic image annotation
Li, Ke
Zou, Changqing
Bu, Shuhui
Liang, Yun
Zhang, Jian
Gong, Minglun
[J]. PATTERN RECOGNITION, 2018, 73 : 1 - 14
[29] A novel multi-modal medical image fusion algorithm
Li, Xinhua
Zhao, Jing
[J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (02) : 1995 - 2002
[30] Image and Encoded Text Fusion for Multi-Modal Classification
Gallo, I.
Calefati, A.
Nawaz, S.
Janjua, M. K.
[J]. 2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 203 - 209

← 1 2 3 4 5 →