Deep medical cross-modal attention hashing

被引:6
|
作者
Zhang, Yong [1 ]
Ou, Weihua [1 ,2 ]
Shi, Yufeng [3 ]
Deng, Jiaxin [1 ]
You, Xinge [3 ]
Wang, Anzhi [1 ]
机构
[1] Guizhou Normal Univ, Sch Big Data & Comp Sci, Sch Math & Sci, Guiyang, Peoples R China
[2] Special Key Lab Artificial Intelligence & Intelli, Guiyang, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Comp Sci & Telecommun Engn, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical cross-modal retrieval; Recurrent attention; Hashing code; Discriminative representation learning;
D O I
10.1007/s11280-021-00881-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Medical cross-modal retrieval aims to retrieve semantically similar medical instances across different modalities, such as retrieving X-ray images using radiology reports or retrieving radiology reports using X-ray images. The main challenge for medical cross-modal retrieval are the semantic gap and the small visual differences between different categories of medical images. To address those issues, we present a novel end-to-end deep hashing method, called Deep Medical Cross-Modal Attention Hashing (DMCAH), which extracts the global features utilizing global average pooling and local features by recurrent attention. Specifically, we recursively move from the coarse to fine-grained regions of images to locate discriminative regions more accurately, and recursively extract the discriminative semantic information of texts from the sentence level to the word level. Then, we select the discriminative features by aggregating the finer feature via adaptive attention. Finally, to reduce the semantic gap, we map images and reports features into a common space and obtain the discriminative hash codes. Comprehensive experimental results on large-scale medical dataset MIMIC-CXR and natural scene dataset MS-COCO show that DMCAH can achieve better performance than existing cross-modal hashing methods.
引用
收藏
页码:1519 / 1536
页数:18
相关论文
共 50 条
  • [21] Weakly Supervised Hashing with Reconstructive Cross-modal Attention
    Du, Yongchao
    Wang, Min
    Lu, Zhenbo
    Zhou, Wengang
    Li, Houqiang
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
  • [22] Self-attention and adversary learning deep hashing network for cross-modal retrieval
    Chen, Shubai
    Wu, Song
    Wang, Li
    Yu, Zhenyang
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2021, 93
  • [23] Noise-robust Deep Cross-Modal Hashing
    Wang, Runmin
    Yu, Guoxian
    Zhang, Hong
    Guo, Maozu
    Cui, Lizhen
    Zhang, Xiangliang
    [J]. INFORMATION SCIENCES, 2021, 581 : 136 - 154
  • [24] Deep Hashing Similarity Learning for Cross-Modal Retrieval
    Ma, Ying
    Wang, Meng
    Lu, Guangyun
    Sun, Yajun
    [J]. IEEE ACCESS, 2024, 12 : 8609 - 8618
  • [25] Deep Discrete Cross-Modal Hashing with Multiple Supervision
    Yu, En
    Ma, Jianhua
    Sun, Jiande
    Chang, Xiaojun
    Zhang, Huaxiang
    Hauptmann, Alexander G.
    [J]. NEUROCOMPUTING, 2022, 486 : 215 - 224
  • [26] Supervised Hierarchical Deep Hashing for Cross-Modal Retrieval
    Zhan, Yu-Wei
    Luo, Xin
    Wang, Yongxin
    Xu, Xin-Shun
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3386 - 3394
  • [27] FUSION-SUPERVISED DEEP CROSS-MODAL HASHING
    Wang, Li
    Zhu, Lei
    Yu, En
    Sun, Jiande
    Zhang, Huaxiang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 37 - 42
  • [28] Ranking-Based Deep Cross-Modal Hashing
    Liu, Xuanwu
    Yu, Guoxian
    Domeniconi, Carlotta
    Wang, Jun
    Ren, Yazhou
    Guo, Maozu
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4400 - 4407
  • [29] Quadruplet-Based Deep Cross-Modal Hashing
    Liu, Huan
    Xiong, Jiang
    Zhang, Nian
    Liu, Fuming
    Zou, Xitao
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [30] Deep Multiscale Fusion Hashing for Cross-Modal Retrieval
    Nie, Xiushan
    Wang, Bowei
    Li, Jiajia
    Hao, Fanchang
    Jian, Muwei
    Yin, Yilong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (01) : 401 - 410