Graph-Based Progressive Fusion Network for Multi-Modality Vehicle Re-Identification

被引:13
|
作者
He, Qiaolin [1 ]
Lu, Zefeng [1 ]
Wang, Zihan [1 ]
Hu, Haifeng [1 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
Vehicle re-identification; multi-modality; graph convolutional networks; data enhancement; PERSON REIDENTIFICATION;
D O I
10.1109/TITS.2023.3285758
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Vehicle re-identification (Re-ID) is a critical task in intelligent transportation, aiming to match vehicle images of the same identity captured by non-overlapping cameras. However, it is difficult to achieve satisfactory results based on RGB images alone in darkness. Therefore, it is of great importance to consider multi-modality vehicle re-identification. Currently, the proposed works deal with different modality features through direct summation and fusion based on heat map, which however ignores the relationship between them. Meanwhile, there is a huge gap between the different modalities, which needs to be reduced. In this paper, to solve the above two problems, we propose a Graph-based Progressive Fusion Network (GPFNet) using a graph convolutional network to adaptively fuse multi-modality features in an end-to-end learning framework. GPFNet consists of a CNN feature extraction module (FEM), a GCN feature fusion module (FFM), and a loss function module (LFM). Firstly, in FEM, we employ a multi-stream network architecture to extract single-modality features and common-modality features and employ a random modality substitution module to extract mixed-modality features. Secondly, in FFM, we design an efficient graph structure to associate the features of different modalities and adopt a progressive two-stage strategy to fuse them. Finally, in LFM, we use GCN-aware multi-modality loss to constrain the features. For reducing modality differences and contributing better initial mixed-modality features to FFM, we propose random modality substitution as a data enhancement method for multi-modality datasets. Extensive experiments on multi-modality vehicle Re-ID datasets RGBN300 and RGBNT100 show that our model achieves state-of-the-art performance.
引用
收藏
页码:12431 / 12447
页数:17
相关论文
共 50 条
  • [31] Progressive learning with multi-scale attention network for cross-domain vehicle re-identification
    Yang Wang
    Jinjia Peng
    Huibing Wang
    Meng Wang
    Science China Information Sciences, 2022, 65
  • [32] Progressive learning with multi-scale attention network for cross-domain vehicle re-identification
    Wang, Yang
    Peng, Jinjia
    Wang, Huibing
    Wang, Meng
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (06)
  • [33] Cross-Modality Person Re-identification Based on Locally Heterogeneous Polymerization Graph Convolutional Network
    Sun R.
    Zhang L.
    Yu Y.-H.
    Zhang X.-D.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (04): : 810 - 825
  • [34] M3L: Multi-modality mining for metric learning in person re-Identification
    Liu, Xiaokai
    Ma, Xiaorui
    Wang, Jie
    Wang, Hongyu
    PATTERN RECOGNITION, 2018, 76 : 650 - 661
  • [35] Progressive learning with multi-scale attention network for cross-domain vehicle re-identification
    Yang WANG
    Jinjia PENG
    Huibing WANG
    Meng WANG
    Science China(Information Sciences), 2022, 65 (06) : 33 - 47
  • [36] MULTI-VIEW VEHICLE IMAGE GENERATION NETWORK FOR VEHICLE RE-IDENTIFICATION
    Xun, Yizhe
    Liu, Jia
    Islam, Sardar M. N.
    Chen, Yuanfang
    2024 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS, ICC WORKSHOPS 2024, 2024, : 517 - 522
  • [37] GCN-Based Multi-Modality Fusion Network for Action Recognition
    Liu, Shaocan
    Wang, Xingtao
    Xiong, Ruiqin
    Fan, Xiaopeng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1242 - 1253
  • [38] Multi-Modality Sensing and Data Fusion for Multi-Vehicle Detection
    Roy, Debashri
    Li, Yuanyuan
    Jian, Tong
    Tian, Peng
    Chowdhury, Kaushik
    Ioannidis, Stratis
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2280 - 2295
  • [39] Graph based Spatial-temporal Fusion for Multi-modal Person Re-identification
    Zhang, Yaobin
    Lv, Jianming
    Liu, Chen
    Cai, Hongmin
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3736 - 3744
  • [40] Cross-view vehicle re-identification based on graph matching
    Zhang, Chao
    Yang, Chule
    Wu, Dayan
    Dong, Hongbin
    Deng, Baosong
    APPLIED INTELLIGENCE, 2022, 52 (13) : 14799 - 14810