Progressively Hybrid Transformer for Multi-Modal Vehicle Re-Identification

被引：4

作者：

Pan, Wenjie ^{[1
]}

Huang, Linhan ^{[1
]}

Liang, Jianbao ^{[1
]}

Hong, Lan ^{[1
]}

Zhu, Jianqing ^{[1
,2
]}

机构：

[1] Huaqiao Univ, Coll Engn, Quanzhou 362021, Peoples R China

[2] Xiamen Yealink Network Technol Co Ltd, 666 Huan Rd, High Tech Pk, Xiamen 361015, Peoples R China

来源：

SENSORS | 2023年 / 23卷 / 09期

基金：

中国国家自然科学基金;

关键词：

multi-modal image; transformer; vehicle re-identification;

D O I：

10.3390/s23094206

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Multi-modal (i.e., visible, near-infrared, and thermal-infrared) vehicle re-identification has good potential to search vehicles of interest in low illumination. However, due to the fact that different modalities have varying imaging characteristics, a proper multi-modal complementary information fusion is crucial to multi-modal vehicle re-identification. For that, this paper proposes a progressively hybrid transformer (PHT). The PHT method consists of two aspects: random hybrid augmentation (RHA) and a feature hybrid mechanism (FHM). Regarding RHA, an image random cropper and a local region hybrider are designed. The image random cropper simultaneously crops multi-modal images of random positions, random numbers, random sizes, and random aspect ratios to generate local regions. The local region hybrider fuses the cropped regions to let regions of each modal bring local structural characteristics of all modalities, mitigating modal differences at the beginning of feature learning. Regarding the FHM, a modal-specific controller and a modal information embedding are designed to effectively fuse multi-modal information at the feature level. Experimental results show the proposed method wins the state-of-the-art method by a larger 2.7% mAP on RGBNT100 and a larger 6.6% mAP on RGBN300, demonstrating that the proposed method can learn multi-modal complementary information effectively.

引用

页数：16

共 50 条

[21] VEHICLE RE-IDENTIFICATION BY MULTI-GRAIN LEARNING
Yang, Xiaoliang
Lang, Congyan
Peng, Peixi
Xing, Junliang
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3113 - 3117
[22] Multi-scale attention vehicle re-identification
Aihua Zheng
Xianmin Lin
Jiacheng Dong
Wenzhong Wang
Jin Tang
Bin Luo
Neural Computing and Applications, 2020, 32 : 17489 - 17503
[23] Multi-Spectral Vehicle Re-Identification: A Challenge
Li, Hongchao
Li, Chenglong
Zhu, Xianpeng
Zheng, Aihua
Luo, Bin
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11345 - 11353
[24] MULTI-VIEW LEARNING FOR VEHICLE RE-IDENTIFICATION
Lin, Weipeng
Li, Yidong
Yang, Xiaoliang
Peng, Peixi
Xing, Junliang
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 832 - 837
[25] Multi-scale attention vehicle re-identification
Zheng, Aihua
Lin, Xianmin
Dong, Jiacheng
Wang, Wenzhong
Tang, Jin
Luo, Bin
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (23): : 17489 - 17503
[26] Joint graph regularized dictionary learning and sparse ranking for multi-modal multi-shot person re-identification
Zheng, Aihua
Li, Hongchao
Jiang, Bo
Zheng, Wei-Shi
Luo, Bin
PATTERN RECOGNITION, 2020, 104 (104)
[27] Vehicle Re-identification method based on Swin-Transformer network
Li J.
Yu C.
Shi J.
Zhang C.
Ke T.
Array, 2022, 16
[28] MART: Mask-Aware Reasoning Transformer for Vehicle Re-Identification
Lu, Zefeng
Lin, Ronghao
Hu, Haifeng
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (02) : 1994 - 2009
[29] Hierarchical Multi-Modal Prompting Transformer for Multi-Modal Long Document Classification
Liu, Tengfei
Hu, Yongli
Gao, Junbin
Sun, Yanfeng
Yin, Baocai
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 6376 - 6390
[30] MEAformer: Multi-modal Entity Alignment Transformer for Meta Modality Hybrid
Chen, Zhuo
Chen, Jiaoyan
Zhang, Wen
Guo, Lingbing
Fang, Yin
Huang, Yufeng
Zhang, Yichi
Geng, Yuxia
Pan, Jeff Z.
Song, Wenting
Chen, Huajun
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3317 - 3327

← 1 2 3 4 5 →