Towards Fast and Accurate Image-Text Retrieval with Self-Supervised Fine-Grained Alignment

被引:0
|
作者
Zhuang, Jiamin [1 ,2 ]
Yu, Jing [1 ,2 ]
Ding, Yang [1 ,2 ]
Qu, Xiangyan [1 ,2 ]
Hu, Yue [1 ,2 ]
机构
[1] Institute of Information Engineering, Chinese Academy of Sciences, China
[2] The School of Cyber Security, University of Chinese Academy of Sciences, China
来源
arXiv | 2023年
关键词
Engineering Village;
D O I
暂无
中图分类号
学科分类号
摘要
Concept levels - Concept-level cross-modal alignment - Context-level cross-modal alignment - Cross-modal - Embeddings - Fast image-text retrieval - Image texts - Self-supervised learning - Text retrieval
引用
收藏
相关论文
共 50 条
  • [21] Learning Relationship-Enhanced Semantic Graph for Fine-Grained Image-Text Matching
    Liu, Xin
    He, Yi
    Cheung, Yiu-Ming
    Xu, Xing
    Wang, Nannan
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (02) : 948 - 961
  • [22] Fine-grained Image-text Matching by Cross-modal Hard Aligning Network
    Pan, Zhengxin
    Wu, Fangyu
    Zhang, Bailing
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19275 - 19284
  • [23] Patch-Level Consistency Regularization in Self-Supervised Transfer Learning for Fine-Grained Image Recognition
    Lee, Yejin
    Lee, Suho
    Hwang, Sangheum
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [24] RSITR-FFT: Efficient Fine-Grained Fine-Tuning Framework With Consistency Regularization for Remote Sensing Image-Text Retrieval
    Xiu, Di
    Ji, Luyan
    Geng, Xiurui
    Wu, Yirong
    [J]. IEEE Geoscience and Remote Sensing Letters, 2024, 21
  • [25] Fine-grained multimodal named entity recognition with heterogeneous image-text similarity graphs
    Wang, Yongpeng
    Jiang, Chunmao
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
  • [26] Lifelong Fine-Grained Image Retrieval
    Chen, Wei
    Xu, Haoyang
    Pu, Nan
    Liu, Yu
    Lao, Mingrui
    Wang, Weiping
    Liu, Li
    Lew, Michael S.
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7533 - 7544
  • [27] Fine-Grained Bidirectional Attention-Based Generative Networks for Image-Text Matching
    Li, Zhixin
    Zhu, Jianwei
    Wei, Jiahui
    Zeng, Yufei
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT III, 2023, 13715 : 390 - 406
  • [28] A robust self-supervised approach for fine-grained crack detection in concrete structures
    Sohaib, Muhammad
    Hasan, Md Junayed
    Shah, Mohd Asif
    Zheng, Zhonglong
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [29] FUMMER: A fine-grained self-supervised momentum distillation framework for multimodal recommendation
    Wei, Yibiao
    Xu, Yang
    Zhu, Lei
    Ma, Jingwei
    Huang, Jiangping
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (05)
  • [30] DKA-RG: Disease-Knowledge-Enhanced Fine-Grained Image-Text Alignment for Automatic Radiology Report Generation
    Yin, Heng
    Wu, Wei
    Hao, Yongtao
    [J]. ELECTRONICS, 2024, 13 (16)