Unsupervised Cross-Modal Hashing via Semantic Text Mining

被引:12
|
作者
Tu, Rong-Cheng [1 ]
Mao, Xian-Ling [1 ]
Lin, Qinghong [2 ]
Ji, Wenjin [1 ]
Qin, Weize [3 ]
Wei, Wei [4 ]
Huang, Heyan [1 ]
机构
[1] Beijing Inst Technol, Dept Comp Sci & Technol, Beijing 100081, Peoples R China
[2] Shenzhen Univ, Sch Comp Sci & Software Engn, Shenzhen 518052, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
[4] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Peoples R China
关键词
Cross-modal retrieval; deep supervised hashing; semantic text mining; self-redefined-similarity loss;
D O I
10.1109/TMM.2023.3243608
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cross-modal hashing has been widely used in multimedia retrieval tasks due to its fast retrieval speed and low storage cost. Recently, many deep unsupervised cross-modal hashing methods have been proposed to deal the unlabeled datasets. These methods usually construct an instance similarity matrix by fusing the image and text modality-specific similarity matrices as the guiding information to train the hashing networks. However, most of them directly use cosine similarities between the bag-of-words (BoW) vectors of text datapoints to define the text modality-specific similarity matrix, which fails to mine the semantic similarity information contained in the text modal datapoints and leads to the poor quality of the instance similarity matrix. To tackle the aforementioned problem, in this paper, we propose a novel Unsupervised Cross-modal Hashing via Semantic Text Mining, called UCHSTM. Specifically, UCHSTM first mines the correlations between the words of text datapoints. Then, UCHSTM constructs the text modality-specific similarity matrix for the training instances based on the mined correlations between their words. Next, UCHSTM fuses the image and text modality-specific similarity matrices as the final instance similarity matrix to guide the training of hashing model. Furthermore, during the process of training the hashing networks, a novel self-redefined-similarity loss is proposed to further correct some wrong defined similarities in the constructed instance similarity matrix, thereby further enhancing the retrieval performance. Extensive experiments on two widely used datasets show that the proposed UCHSTM outperforms state-of-the-art baselines on cross-modal retrieval tasks.
引用
收藏
页码:8946 / 8957
页数:12
相关论文
共 50 条
  • [1] Semantic-rebased cross-modal hashing for scalable unsupervised text-visual retrieval
    Wang, Weiwei
    Shen, Yuming
    Zhang, Haofeng
    Liu, Li
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
  • [2] Unsupervised Contrastive Cross-Modal Hashing
    Hu, Peng
    Zhu, Hongyuan
    Lin, Jie
    Peng, Dezhong
    Zhao, Yin-Ping
    Peng, Xi
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3877 - 3889
  • [3] Completely Unsupervised Cross-Modal Hashing
    Duan, Jiasheng
    Zhang, Pengfei
    Huang, Zi
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT I, 2020, 12112 : 178 - 194
  • [4] Semantic deep cross-modal hashing
    Lin, Qiubin
    Cao, Wenming
    He, Zhihai
    He, Zhiquan
    [J]. NEUROCOMPUTING, 2020, 396 : 113 - 122
  • [5] Deep noise mitigation and semantic reconstruction hashing for unsupervised cross-modal retrieval
    Cheng Zhang
    Yuan Wan
    Haopeng Qiang
    [J]. Neural Computing and Applications, 2024, 36 : 5383 - 5397
  • [6] Unsupervised Generative Adversarial Cross-Modal Hashing
    Zhang, Jian
    Peng, Yuxin
    Yuan, Mingkuan
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 539 - 546
  • [7] Set and Rebase: Determining the Semantic Graph Connectivity for Unsupervised Cross-Modal Hashing
    Wang, Weiwei
    Shen, Yuming
    Zhang, Haofeng
    Yao, Yazhou
    Liu, Li
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 853 - 859
  • [8] Deep noise mitigation and semantic reconstruction hashing for unsupervised cross-modal retrieval
    Zhang, Cheng
    Wan, Yuan
    Qiang, Haopeng
    [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (10): : 5383 - 5397
  • [9] Unsupervised Deep Fusion Cross-modal Hashing
    Huang, Jiaming
    Min, Chen
    Jing, Liping
    [J]. ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2019, : 358 - 366
  • [10] Deep Semantic-Preserving Reconstruction Hashing for Unsupervised Cross-Modal Retrieval
    Cheng, Shuli
    Wang, Liejun
    Du, Anyu
    [J]. ENTROPY, 2020, 22 (11) : 1 - 22