Graph-based Consistent Reconstruction and Alignment for imbalanced text-image person re-identification

被引:0
|
作者
Du, Guodong [1 ]
Gong, Tiantian [1 ]
Zhang, Liyan [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Person re-identification; Image-text retrieval; Cross-modal alignment; Modality imbalance; Robustness;
D O I
10.1016/j.eswa.2024.125429
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-image person re-identification (TIReID) has emerged as a versatile approach for retrieving target pedestrians using textual descriptions. However, current TIReID research has been overly idealistic and has overlooked the issues of data incompleteness and modal imbalance in real-world application scenarios. Therefore, in this paper, we propose imbalanced text-image person re-identification (ITIReID) to address these problems. In comparison to TIReID, ITIReID contains a larger proportion of unimodal data, which leads to modal imbalance. The setting of ITIReID is more aligned with real-world scenarios, and studying ITIReID can expand the application scalability of TIReID. We propose a Graph-based Consistent Reconstruction and Alignment framework (GCRA), for ITIReID, which achieves modal balance by completing missing modality features for training implementation. By treating the accessible modality features as graph nodes, GCRA firstly builds an adjacency graph where a new semantic distance that establishes semantic relevance between nodes by comprehensively measuring both intra-modality and inter-modality correlation, serves as the measurement of graph's edges. GCRA further reconstructs the missing nodes - thus re-establishing missing modality features - using existing nodes connected with high semantic relevance. To ensure the reliability and effectiveness of reconstructed features, we propose a proxy-based identity constraint and a reconstruction constraint. In addition, to enable effective semantic alignment using both the reconstructed features and original features, we introduce a cross-modal semantic constraint. Extensive experiments demonstrate that GCRA can effectively handle issues of data incompleteness and modal imbalance, exhibiting its effectiveness and superiority.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Unsupervised Person Re-identification via Graph-Structured Image Matching
    Xu, Bolei
    Qiu, Guoping
    COMPUTER VISION - ACCV 2016 WORKSHOPS, PT III, 2017, 10118 : 301 - 314
  • [32] CLUSTER-BASED DISTRIBUTION ALIGNMENT FOR GENERALIZABLE PERSON RE-IDENTIFICATION
    Zhu, Chengzhang
    Chang, Zhe
    Xiao, Yalong
    Zou, Beiji
    Li, Bozhou
    Liu, Shu
    2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [33] Research on person re-identification based on posture guidance and feature alignment
    Che, Jin
    Zhang, Yuxia
    Yang, Qi
    He, Yuting
    MULTIMEDIA SYSTEMS, 2023, 29 (02) : 763 - 770
  • [34] A Novel Collaborative Consistent Learning for Person Re-Identification
    Wang, Xiaoman
    Li, Ruidong
    Wang, Li
    Gao, Kai
    Cao, Fang
    Cui, Qianjin
    2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 101 - 105
  • [35] SCANet: Person Re-Identification with Semantically Consistent Attention
    Li, Ce
    Jin, Shangang
    Chang, Enbing
    Xuan, Shuxing
    Liu, Fenghua
    Xu, Dayou
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 3424 - 3428
  • [36] Text-image Alignment for Diffusion-based Perception
    Kondapanenil, Neehar
    Marksl, Markus
    Knott, Manuel
    Guimaraes, Rogerio
    Perona, Pietro
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 13883 - 13893
  • [37] GRAPH CONVOLUTION FOR RE-RANKING IN PERSON RE-IDENTIFICATION
    Zhang, Yuqi
    Qian, Qi
    Liu, Chong
    Chen, Weihua
    Wang, Fan
    Li, Hao
    Jin, Rong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2704 - 2708
  • [38] Meta Distribution Alignment for Generalizable Person Re-Identification
    Ni, Hao
    Song, Jingkuan
    Luo, Xiaopeng
    Zheng, Feng
    Li, Wen
    Shen, Heng Tao
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2477 - 2486
  • [39] Person Re-Identification Method Based on Image Style Transfer
    Wang C.-K.
    Chen Y.-L.
    Cai X.-D.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2021, 44 (03): : 67 - 72
  • [40] Parallel Data Augmentation for Text-based Person Re-identification
    Cai, Han-Qing
    Li, Xin
    Ji, Yi
    Li, Ying
    Liu, Chun-Ping
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,