Efficient cross-modality feature interaction for multispectral armored vehicle detection

被引：0

作者：

Zhang, Jie ^{[1
]}

Chang, Tian-qing ^{[1
]}

Zhao, Li-yang ^{[2
]}

Ma, Jin-dun ^{[3
]}

Han, Bin ^{[1
]}

Zhang, Lei ^{[1
]}

机构：

[1] Army Acad Armored Forces, Beijing 100072, Peoples R China

[2] PLA Naval Submarine Acad, Qingdao 266199, Peoples R China

[3] PLA, Unit 63966, Beijing 100072, Peoples R China

来源：

APPLIED SOFT COMPUTING | 2024年 / 163卷

关键词：

Cross-modality; Armored vehicle detection; Feature interaction; Multispectral; RECOGNITION;

D O I：

10.1016/j.asoc.2024.111971

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Detecting armed vehicles from a UAV platform is challenging due to the complexity of ground environment. This paper presents a dual-stream multispectral armored vehicle detection method to tackle this problem. First, considering that there is a paucity of datasets containing multispectral armored vehicle images, a multispectral armored vehicle detection dataset is constructed for this study. The dataset consists of 5853 pairs of RGB and infrared images, featuring a total of 15,878 instances of armored vehicles. Then, a cross-modal feature interaction module is designed to enable efficient feature interaction between multispectral images. This module uses the cross-modal channel-wise feature difference method to model the channel differences between the two modal features and obtains the cross-modal channel difference matrix. The cross-modal channel difference matrix is then employed to extract the unique features of the two modal features, allowing for efficient cross-modal feature interaction by complementing each other's unique features. Experiment results demonstrate that the proposed model has excellent detection performance and is capable of coping with various challenges brought by complex ground environments.

引用

页数：13

共 50 条

[1] Attention-based Cross-modality Interaction for Multispectral Pedestrian Detection
Liu, Tianshan
Zhao, Rui
Lam, Kin-Man
INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2021, 2021, 11766
[2] Attention-Based Cross-Modality Feature Complementation for Multispectral Pedestrian Detection
Jiang, Qunyan
Dai, Juying
Rui, Ting
Shao, Faming
Wang, Jinkang
Lu, Guanlin
IEEE ACCESS, 2022, 10 : 53797 - 53809
[3] Cross-modality attentive feature fusion for object detection in multispectral remote sensing imagery
Fang Qingyun
Wang Zhaokui
PATTERN RECOGNITION, 2022, 130
[4] Cross-modality attentive feature fusion for object detection in multispectral remote sensing imagery
Qingyun, Fang
Zhaokui, Wang
Pattern Recognition, 2022, 130
[5] Cyclic Cross-Modality Interaction for Hyperspectral and Multispectral Image Fusion
Chen, Shi
Zhang, Lefei
Zhang, Liangpei
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 741 - 753
[6] Cross-modality interaction for few-shot multispectral object detection with semantic knowledge
Huang, Lian
Peng, Zongju
Chen, Fen
Dai, Shaosheng
He, Ziqiang
Liu, Kesheng
NEURAL NETWORKS, 2024, 173
[7] Cross-modality interaction for few-shot multispectral object detection with semantic knowledge
Huang, Lian
Peng, Zongju
Chen, Fen
Dai, Shaosheng
He, Ziqiang
Liu, Kesheng
Neural Networks, 2024, 173
[8] Cross-modality complementary information fusion for multispectral pedestrian detection
Yan, Chaoqi
Zhang, Hong
Li, Xuliang
Yang, Yifan
Yuan, Ding
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (14): : 10361 - 10386
[9] Cross-modality interactive attention network for multispectral pedestrian detection
Zhang, Lu
Liu, Zhiyong
Zhang, Shifeng
Yang, Xu
Qiao, Hong
Huang, Kaizhu
Hussain, Amir
INFORMATION FUSION, 2019, 50 : 20 - 29
[10] Cross-modality complementary information fusion for multispectral pedestrian detection
Chaoqi Yan
Hong Zhang
Xuliang Li
Yifan Yang
Ding Yuan
Neural Computing and Applications, 2023, 35 : 10361 - 10386

← 1 2 3 4 5 →