Progressively Hybrid Transformer for Multi-Modal Vehicle Re-Identification

被引:4
|
作者
Pan, Wenjie [1 ]
Huang, Linhan [1 ]
Liang, Jianbao [1 ]
Hong, Lan [1 ]
Zhu, Jianqing [1 ,2 ]
机构
[1] Huaqiao Univ, Coll Engn, Quanzhou 362021, Peoples R China
[2] Xiamen Yealink Network Technol Co Ltd, 666 Huan Rd, High Tech Pk, Xiamen 361015, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-modal image; transformer; vehicle re-identification;
D O I
10.3390/s23094206
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Multi-modal (i.e., visible, near-infrared, and thermal-infrared) vehicle re-identification has good potential to search vehicles of interest in low illumination. However, due to the fact that different modalities have varying imaging characteristics, a proper multi-modal complementary information fusion is crucial to multi-modal vehicle re-identification. For that, this paper proposes a progressively hybrid transformer (PHT). The PHT method consists of two aspects: random hybrid augmentation (RHA) and a feature hybrid mechanism (FHM). Regarding RHA, an image random cropper and a local region hybrider are designed. The image random cropper simultaneously crops multi-modal images of random positions, random numbers, random sizes, and random aspect ratios to generate local regions. The local region hybrider fuses the cropped regions to let regions of each modal bring local structural characteristics of all modalities, mitigating modal differences at the beginning of feature learning. Regarding the FHM, a modal-specific controller and a modal information embedding are designed to effectively fuse multi-modal information at the feature level. Experimental results show the proposed method wins the state-of-the-art method by a larger 2.7% mAP on RGBNT100 and a larger 6.6% mAP on RGBN300, demonstrating that the proposed method can learn multi-modal complementary information effectively.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] VEHICLE RE-IDENTIFICATION BY MULTI-GRAIN LEARNING
    Yang, Xiaoliang
    Lang, Congyan
    Peng, Peixi
    Xing, Junliang
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3113 - 3117
  • [22] Multi-scale attention vehicle re-identification
    Aihua Zheng
    Xianmin Lin
    Jiacheng Dong
    Wenzhong Wang
    Jin Tang
    Bin Luo
    Neural Computing and Applications, 2020, 32 : 17489 - 17503
  • [23] Multi-Spectral Vehicle Re-Identification: A Challenge
    Li, Hongchao
    Li, Chenglong
    Zhu, Xianpeng
    Zheng, Aihua
    Luo, Bin
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11345 - 11353
  • [24] MULTI-VIEW LEARNING FOR VEHICLE RE-IDENTIFICATION
    Lin, Weipeng
    Li, Yidong
    Yang, Xiaoliang
    Peng, Peixi
    Xing, Junliang
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 832 - 837
  • [25] Multi-scale attention vehicle re-identification
    Zheng, Aihua
    Lin, Xianmin
    Dong, Jiacheng
    Wang, Wenzhong
    Tang, Jin
    Luo, Bin
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (23): : 17489 - 17503
  • [26] Joint graph regularized dictionary learning and sparse ranking for multi-modal multi-shot person re-identification
    Zheng, Aihua
    Li, Hongchao
    Jiang, Bo
    Zheng, Wei-Shi
    Luo, Bin
    PATTERN RECOGNITION, 2020, 104 (104)
  • [27] Vehicle Re-identification method based on Swin-Transformer network
    Li J.
    Yu C.
    Shi J.
    Zhang C.
    Ke T.
    Array, 2022, 16
  • [28] MART: Mask-Aware Reasoning Transformer for Vehicle Re-Identification
    Lu, Zefeng
    Lin, Ronghao
    Hu, Haifeng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (02) : 1994 - 2009
  • [29] Hierarchical Multi-Modal Prompting Transformer for Multi-Modal Long Document Classification
    Liu, Tengfei
    Hu, Yongli
    Gao, Junbin
    Sun, Yanfeng
    Yin, Baocai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 6376 - 6390
  • [30] MEAformer: Multi-modal Entity Alignment Transformer for Meta Modality Hybrid
    Chen, Zhuo
    Chen, Jiaoyan
    Zhang, Wen
    Guo, Lingbing
    Fang, Yin
    Huang, Yufeng
    Zhang, Yichi
    Geng, Yuxia
    Pan, Jeff Z.
    Song, Wenting
    Chen, Huajun
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3317 - 3327