Deep medical cross-modal attention hashing

被引：6

作者：

Zhang, Yong ^{[1
]}

Ou, Weihua ^{[1
,2
]}

Shi, Yufeng ^{[3
]}

Deng, Jiaxin ^{[1
]}

You, Xinge ^{[3
]}

Wang, Anzhi ^{[1
]}

机构：

[1] Guizhou Normal Univ, Sch Big Data & Comp Sci, Sch Math & Sci, Guiyang, Peoples R China

[2] Special Key Lab Artificial Intelligence & Intelli, Guiyang, Peoples R China

[3] Huazhong Univ Sci & Technol, Sch Comp Sci & Telecommun Engn, Wuhan, Peoples R China

来源：

WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2022年 / 25卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Medical cross-modal retrieval; Recurrent attention; Hashing code; Discriminative representation learning;

D O I：

10.1007/s11280-021-00881-8

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Medical cross-modal retrieval aims to retrieve semantically similar medical instances across different modalities, such as retrieving X-ray images using radiology reports or retrieving radiology reports using X-ray images. The main challenge for medical cross-modal retrieval are the semantic gap and the small visual differences between different categories of medical images. To address those issues, we present a novel end-to-end deep hashing method, called Deep Medical Cross-Modal Attention Hashing (DMCAH), which extracts the global features utilizing global average pooling and local features by recurrent attention. Specifically, we recursively move from the coarse to fine-grained regions of images to locate discriminative regions more accurately, and recursively extract the discriminative semantic information of texts from the sentence level to the word level. Then, we select the discriminative features by aggregating the finer feature via adaptive attention. Finally, to reduce the semantic gap, we map images and reports features into a common space and obtain the discriminative hash codes. Comprehensive experimental results on large-scale medical dataset MIMIC-CXR and natural scene dataset MS-COCO show that DMCAH can achieve better performance than existing cross-modal hashing methods.

引用

页码：1519 / 1536

页数：18

共 50 条

[21] Weakly Supervised Hashing with Reconstructive Cross-modal Attention
Du, Yongchao
Wang, Min
Lu, Zhenbo
Zhou, Wengang
Li, Houqiang
[J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
[22] Self-attention and adversary learning deep hashing network for cross-modal retrieval
Chen, Shubai
Wu, Song
Wang, Li
Yu, Zhenyang
[J]. COMPUTERS & ELECTRICAL ENGINEERING, 2021, 93
[23] Noise-robust Deep Cross-Modal Hashing
Wang, Runmin
Yu, Guoxian
Zhang, Hong
Guo, Maozu
Cui, Lizhen
Zhang, Xiangliang
[J]. INFORMATION SCIENCES, 2021, 581 : 136 - 154
[24] Deep Hashing Similarity Learning for Cross-Modal Retrieval
Ma, Ying
Wang, Meng
Lu, Guangyun
Sun, Yajun
[J]. IEEE ACCESS, 2024, 12 : 8609 - 8618
[25] Deep Discrete Cross-Modal Hashing with Multiple Supervision
Yu, En
Ma, Jianhua
Sun, Jiande
Chang, Xiaojun
Zhang, Huaxiang
Hauptmann, Alexander G.
[J]. NEUROCOMPUTING, 2022, 486 : 215 - 224
[26] Supervised Hierarchical Deep Hashing for Cross-Modal Retrieval
Zhan, Yu-Wei
Luo, Xin
Wang, Yongxin
Xu, Xin-Shun
[J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3386 - 3394
[27] FUSION-SUPERVISED DEEP CROSS-MODAL HASHING
Wang, Li
Zhu, Lei
Yu, En
Sun, Jiande
Zhang, Huaxiang
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 37 - 42
[28] Ranking-Based Deep Cross-Modal Hashing
Liu, Xuanwu
Yu, Guoxian
Domeniconi, Carlotta
Wang, Jun
Ren, Yazhou
Guo, Maozu
[J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4400 - 4407
[29] Quadruplet-Based Deep Cross-Modal Hashing
Liu, Huan
Xiong, Jiang
Zhang, Nian
Liu, Fuming
Zou, Xitao
[J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
[30] Deep Multiscale Fusion Hashing for Cross-Modal Retrieval
Nie, Xiushan
Wang, Bowei
Li, Jiajia
Hao, Fanchang
Jian, Muwei
Yin, Yilong
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (01) : 401 - 410

← 1 2 3 4 5 →