Progressively Hybrid Transformer for Multi-Modal Vehicle Re-Identification

被引:4
|
作者
Pan, Wenjie [1 ]
Huang, Linhan [1 ]
Liang, Jianbao [1 ]
Hong, Lan [1 ]
Zhu, Jianqing [1 ,2 ]
机构
[1] Huaqiao Univ, Coll Engn, Quanzhou 362021, Peoples R China
[2] Xiamen Yealink Network Technol Co Ltd, 666 Huan Rd, High Tech Pk, Xiamen 361015, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-modal image; transformer; vehicle re-identification;
D O I
10.3390/s23094206
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Multi-modal (i.e., visible, near-infrared, and thermal-infrared) vehicle re-identification has good potential to search vehicles of interest in low illumination. However, due to the fact that different modalities have varying imaging characteristics, a proper multi-modal complementary information fusion is crucial to multi-modal vehicle re-identification. For that, this paper proposes a progressively hybrid transformer (PHT). The PHT method consists of two aspects: random hybrid augmentation (RHA) and a feature hybrid mechanism (FHM). Regarding RHA, an image random cropper and a local region hybrider are designed. The image random cropper simultaneously crops multi-modal images of random positions, random numbers, random sizes, and random aspect ratios to generate local regions. The local region hybrider fuses the cropped regions to let regions of each modal bring local structural characteristics of all modalities, mitigating modal differences at the beginning of feature learning. Regarding the FHM, a modal-specific controller and a modal information embedding are designed to effectively fuse multi-modal information at the feature level. Experimental results show the proposed method wins the state-of-the-art method by a larger 2.7% mAP on RGBNT100 and a larger 6.6% mAP on RGBN300, demonstrating that the proposed method can learn multi-modal complementary information effectively.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Multi-Proxy Constraint Loss for Vehicle Re-Identification
    Chen, Xu
    Sui, Haigang
    Fang, Jian
    Zhou, Mingting
    Wu, Chen
    SENSORS, 2020, 20 (18) : 1 - 15
  • [32] Multi-modal long document classification based on Hierarchical Prompt and Multi-modal Transformer
    Liu T.
    Hu Y.
    Gao J.
    Wang J.
    Sun Y.
    Yin B.
    Neural Networks, 2024, 176
  • [33] Multi-granularity Cross Transformer Network for person re-identification
    Li, Yanping
    Miao, Duoqian
    Zhang, Hongyun
    Zhou, Jie
    Zhao, Cairong
    PATTERN RECOGNITION, 2024, 150
  • [34] Exploiting Multi-View Part-Wise Correlation via an Efficient Transformer for Vehicle Re-Identification
    Li, Ming
    Liu, Jun
    Zheng, Ce
    Huang, Xinming
    Zhang, Ziming
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 919 - 929
  • [35] A hybrid fusion framework to multi-modal bio metric identification
    Mohammed Chachan Younis
    Huthaifa Abuhammad
    Multimedia Tools and Applications, 2021, 80 : 25799 - 25822
  • [36] A hybrid fusion framework to multi-modal bio metric identification
    Younis, Mohammed Chachan
    Abuhammad, Huthaifa
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (17) : 25799 - 25822
  • [37] Hybrid parameter identification of a multi-modal underwater soft robot
    Giorgio-Serchi, F.
    Arienti, A.
    Corucci, F.
    Giorelli, M.
    Laschi, C.
    BIOINSPIRATION & BIOMIMETICS, 2017, 12 (02) : 1 - 15
  • [38] Vehicle Classification and Identification Using Multi-Modal Sensing and Signal Learning
    Kerekes, Ryan A.
    Karnowski, Thomas P.
    Kuhn, Mike
    Moore, Michael R.
    Stinson, Brad
    Tokola, Ryan
    Anderson, Adam
    Vann, Jason M.
    2017 IEEE 85TH VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2017,
  • [39] A Multi-Modal Transformer network for action detection
    Korban, Matthew
    Youngs, Peter
    Acton, Scott T.
    PATTERN RECOGNITION, 2023, 142
  • [40] Multi-Modal Adversarial Example Detection with Transformer
    Ding, Chaoyue
    Sun, Shiliang
    Zhao, Jing
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,