Graph-based Consistent Reconstruction and Alignment for imbalanced text-image person re-identification

被引:0
|
作者
Du, Guodong [1 ]
Gong, Tiantian [1 ]
Zhang, Liyan [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Person re-identification; Image-text retrieval; Cross-modal alignment; Modality imbalance; Robustness;
D O I
10.1016/j.eswa.2024.125429
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-image person re-identification (TIReID) has emerged as a versatile approach for retrieving target pedestrians using textual descriptions. However, current TIReID research has been overly idealistic and has overlooked the issues of data incompleteness and modal imbalance in real-world application scenarios. Therefore, in this paper, we propose imbalanced text-image person re-identification (ITIReID) to address these problems. In comparison to TIReID, ITIReID contains a larger proportion of unimodal data, which leads to modal imbalance. The setting of ITIReID is more aligned with real-world scenarios, and studying ITIReID can expand the application scalability of TIReID. We propose a Graph-based Consistent Reconstruction and Alignment framework (GCRA), for ITIReID, which achieves modal balance by completing missing modality features for training implementation. By treating the accessible modality features as graph nodes, GCRA firstly builds an adjacency graph where a new semantic distance that establishes semantic relevance between nodes by comprehensively measuring both intra-modality and inter-modality correlation, serves as the measurement of graph's edges. GCRA further reconstructs the missing nodes - thus re-establishing missing modality features - using existing nodes connected with high semantic relevance. To ensure the reliability and effectiveness of reconstructed features, we propose a proxy-based identity constraint and a reconstruction constraint. In addition, to enable effective semantic alignment using both the reconstructed features and original features, we introduce a cross-modal semantic constraint. Extensive experiments demonstrate that GCRA can effectively handle issues of data incompleteness and modal imbalance, exhibiting its effectiveness and superiority.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Person re-identification with part prediction alignment
    Li, Zhiyong
    Lv, Jingyi
    Chen, Ying
    Yuan, Jin
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 205
  • [22] Unifying Multi-Modal Uncertainty Modeling and Semantic Alignment for Text-to-Image Person Re-identification
    Zhao, Zhiwei
    Liu, Bin
    Lu, Yan
    Chu, Qi
    Yu, Nenghai
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7534 - 7542
  • [23] Weakly Supervised Text-based Person Re-Identification
    Zhao, Shizhen
    Gao, Changxin
    Shao, Yuanjie
    Zheng, Wei-Shi
    Sang, Nong
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11375 - 11384
  • [24] Text Based Unsupervised Domain Generalization Person Re-identification
    Zhang, Guoqing
    Jin, Tong
    Liu, Tianqi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XV, 2025, 15045 : 377 - 391
  • [25] Salient feature based graph matching for person re-identification
    Iodice, Sara
    Petrosino, Alfredo
    PATTERN RECOGNITION, 2015, 48 (04) : 1074 - 1085
  • [26] PERSON RE-IDENTIFICATION BASED ON HIERARCHICAL BIPARTITE GRAPH MATCHING
    Huang, Yan
    Sheng, Hao
    Xiong, Zhang
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 4255 - 4259
  • [27] Cross-modality neighbor constraints based unbalanced multi-view text-image re-identification
    Li, Yongxi
    Tang, Wenzhong
    Zhang, Ke
    Zhu, Xi
    Wang, Haoming
    Wang, Shuai
    MULTIMEDIA SYSTEMS, 2024, 30 (06)
  • [28] Unsupervised Graph Association for Person Re-identification
    Wu, Jinlin
    Yang, Yang
    Liu, Hao
    Liao, Shengcai
    Lei, Zhen
    Li, Stan Z.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8320 - 8329
  • [29] Graph Correspondence Transfer for Person Re-Identification
    Zhou, Qin
    Fan, Heng
    Zheng, Shibao
    Su, Hang
    Li, Xinzhe
    Wu, Shuang
    Ling, Haibin
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7599 - 7606
  • [30] Research on person re-identification based on posture guidance and feature alignment
    Jin Che
    Yuxia Zhang
    Qi Yang
    Yuting He
    Multimedia Systems, 2023, 29 : 763 - 770