Efficient cross-modality feature interaction for multispectral armored vehicle detection

被引：0

作者：

Zhang, Jie ^{[1
]}

Chang, Tian-qing ^{[1
]}

Zhao, Li-yang ^{[2
]}

Ma, Jin-dun ^{[3
]}

Han, Bin ^{[1
]}

Zhang, Lei ^{[1
]}

机构：

[1] Army Acad Armored Forces, Beijing 100072, Peoples R China

[2] PLA Naval Submarine Acad, Qingdao 266199, Peoples R China

[3] PLA, Unit 63966, Beijing 100072, Peoples R China

来源：

APPLIED SOFT COMPUTING | 2024年 / 163卷

关键词：

Cross-modality; Armored vehicle detection; Feature interaction; Multispectral; RECOGNITION;

D O I：

10.1016/j.asoc.2024.111971

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Detecting armed vehicles from a UAV platform is challenging due to the complexity of ground environment. This paper presents a dual-stream multispectral armored vehicle detection method to tackle this problem. First, considering that there is a paucity of datasets containing multispectral armored vehicle images, a multispectral armored vehicle detection dataset is constructed for this study. The dataset consists of 5853 pairs of RGB and infrared images, featuring a total of 15,878 instances of armored vehicles. Then, a cross-modal feature interaction module is designed to enable efficient feature interaction between multispectral images. This module uses the cross-modal channel-wise feature difference method to model the channel differences between the two modal features and obtains the cross-modal channel difference matrix. The cross-modal channel difference matrix is then employed to extract the unique features of the two modal features, allowing for efficient cross-modal feature interaction by complementing each other's unique features. Experiment results demonstrate that the proposed model has excellent detection performance and is capable of coping with various challenges brought by complex ground environments.

引用

页数：13

共 50 条

[31] Partial Unbalanced Feature Transport for Cross-Modality Cardiac Image Segmentation
Dong, Shunjie
Pan, Zixuan
Fu, Yu
Xu, Dongwei
Shi, Kuangyu
Yang, Qianqian
Shi, Yiyu
Zhuo, Cheng
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (06) : 1758 - 1773
[32] Hierarchical Feature Fusion for Cross-Modality Person Re-identification
Fu, Wen
Lim, Monghao
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (16)
[33] Transformer-Based Visual Grounding with Cross-Modality Interaction
Li, Kun
Li, Jiaxiu
Guo, Dan
Yang, Xun
Wang, Meng
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
[34] Dynamic feature weakening for cross-modality person re-identification*
Lu, Jian
Chen, Mengdie
Wang, Hangying
Pang, Feifei
COMPUTERS & ELECTRICAL ENGINEERING, 2023, 109
[35] Cross-Modality Interaction-Based Traffic Accident Classification
Oh, Changhyeon
Ban, Yuseok
APPLIED SCIENCES-BASEL, 2024, 14 (05):
[36] ContextMatcher: Detector-Free Feature Matching With Cross-Modality Context
Li, Dongyue
Du, Songlin
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 7922 - 7934
[37] Cross-Modality Proposal-Guided Feature Mining for Unregistered RGB-Thermal Pedestrian Detection
Tian, Chao
Zhou, Zikun
Huang, Yuqing
Li, Gaojun
He, Zhenyu
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6449 - 6461
[38] CAFCNet: Cross-modality asymmetric feature complement network for RGB-T salient object detection
Jin, Dongze
Shao, Feng
Xie, Zhengxuan
Mu, Baoyang
Chen, Hangwei
Jiang, Qiuping
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 247
[39] Enhancing Lidar and Radar Fusion for Vehicle Detection in Adverse Weather via Cross-Modality Semantic Consistency
Du, Yu
Yang, Ting
Chang, Qiong
Zhong, Wei
Wang, Weimin
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 439 - 451
[40] MSSA: Multispectral Semantic Alignment for Cross-Modality Infrared-RGB Person Reidentification
Chen, Qingshan
Zhang, Moyan
Quan, Zhenzhen
Zhang, Yumeng
Mozerov, Mikhail G.
Zhai, Chao
Li, Hongjuan
Li, Yujun
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,

← 1 2 3 4 5 →