Progressively Hybrid Transformer for Multi-Modal Vehicle Re-Identification

被引：4

作者：

Pan, Wenjie ^{[1
]}

Huang, Linhan ^{[1
]}

Liang, Jianbao ^{[1
]}

Hong, Lan ^{[1
]}

Zhu, Jianqing ^{[1
,2
]}

机构：

[1] Huaqiao Univ, Coll Engn, Quanzhou 362021, Peoples R China

[2] Xiamen Yealink Network Technol Co Ltd, 666 Huan Rd, High Tech Pk, Xiamen 361015, Peoples R China

来源：

SENSORS | 2023年 / 23卷 / 09期

基金：

中国国家自然科学基金;

关键词：

multi-modal image; transformer; vehicle re-identification;

D O I：

10.3390/s23094206

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Multi-modal (i.e., visible, near-infrared, and thermal-infrared) vehicle re-identification has good potential to search vehicles of interest in low illumination. However, due to the fact that different modalities have varying imaging characteristics, a proper multi-modal complementary information fusion is crucial to multi-modal vehicle re-identification. For that, this paper proposes a progressively hybrid transformer (PHT). The PHT method consists of two aspects: random hybrid augmentation (RHA) and a feature hybrid mechanism (FHM). Regarding RHA, an image random cropper and a local region hybrider are designed. The image random cropper simultaneously crops multi-modal images of random positions, random numbers, random sizes, and random aspect ratios to generate local regions. The local region hybrider fuses the cropped regions to let regions of each modal bring local structural characteristics of all modalities, mitigating modal differences at the beginning of feature learning. Regarding the FHM, a modal-specific controller and a modal information embedding are designed to effectively fuse multi-modal information at the feature level. Experimental results show the proposed method wins the state-of-the-art method by a larger 2.7% mAP on RGBNT100 and a larger 6.6% mAP on RGBN300, demonstrating that the proposed method can learn multi-modal complementary information effectively.

引用

页数：16

共 50 条

[31] Multi-Proxy Constraint Loss for Vehicle Re-Identification
Chen, Xu
Sui, Haigang
Fang, Jian
Zhou, Mingting
Wu, Chen
SENSORS, 2020, 20 (18) : 1 - 15
[32] Multi-modal long document classification based on Hierarchical Prompt and Multi-modal Transformer
Liu T.
Hu Y.
Gao J.
Wang J.
Sun Y.
Yin B.
Neural Networks, 2024, 176
[33] Multi-granularity Cross Transformer Network for person re-identification
Li, Yanping
Miao, Duoqian
Zhang, Hongyun
Zhou, Jie
Zhao, Cairong
PATTERN RECOGNITION, 2024, 150
[34] Exploiting Multi-View Part-Wise Correlation via an Efficient Transformer for Vehicle Re-Identification
Li, Ming
Liu, Jun
Zheng, Ce
Huang, Xinming
Zhang, Ziming
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 919 - 929
[35] A hybrid fusion framework to multi-modal bio metric identification
Mohammed Chachan Younis
Huthaifa Abuhammad
Multimedia Tools and Applications, 2021, 80 : 25799 - 25822
[36] A hybrid fusion framework to multi-modal bio metric identification
Younis, Mohammed Chachan
Abuhammad, Huthaifa
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (17) : 25799 - 25822
[37] Hybrid parameter identification of a multi-modal underwater soft robot
Giorgio-Serchi, F.
Arienti, A.
Corucci, F.
Giorelli, M.
Laschi, C.
BIOINSPIRATION & BIOMIMETICS, 2017, 12 (02) : 1 - 15
[38] Vehicle Classification and Identification Using Multi-Modal Sensing and Signal Learning
Kerekes, Ryan A.
Karnowski, Thomas P.
Kuhn, Mike
Moore, Michael R.
Stinson, Brad
Tokola, Ryan
Anderson, Adam
Vann, Jason M.
2017 IEEE 85TH VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2017,
[39] A Multi-Modal Transformer network for action detection
Korban, Matthew
Youngs, Peter
Acton, Scott T.
PATTERN RECOGNITION, 2023, 142
[40] Multi-Modal Adversarial Example Detection with Transformer
Ding, Chaoyue
Sun, Shiliang
Zhao, Jing
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,

← 1 2 3 4 5 →