Efficient cross-modality feature interaction for multispectral armored vehicle detection

被引:0
|
作者
Zhang, Jie [1 ]
Chang, Tian-qing [1 ]
Zhao, Li-yang [2 ]
Ma, Jin-dun [3 ]
Han, Bin [1 ]
Zhang, Lei [1 ]
机构
[1] Army Acad Armored Forces, Beijing 100072, Peoples R China
[2] PLA Naval Submarine Acad, Qingdao 266199, Peoples R China
[3] PLA, Unit 63966, Beijing 100072, Peoples R China
关键词
Cross-modality; Armored vehicle detection; Feature interaction; Multispectral; RECOGNITION;
D O I
10.1016/j.asoc.2024.111971
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting armed vehicles from a UAV platform is challenging due to the complexity of ground environment. This paper presents a dual-stream multispectral armored vehicle detection method to tackle this problem. First, considering that there is a paucity of datasets containing multispectral armored vehicle images, a multispectral armored vehicle detection dataset is constructed for this study. The dataset consists of 5853 pairs of RGB and infrared images, featuring a total of 15,878 instances of armored vehicles. Then, a cross-modal feature interaction module is designed to enable efficient feature interaction between multispectral images. This module uses the cross-modal channel-wise feature difference method to model the channel differences between the two modal features and obtains the cross-modal channel difference matrix. The cross-modal channel difference matrix is then employed to extract the unique features of the two modal features, allowing for efficient cross-modal feature interaction by complementing each other's unique features. Experiment results demonstrate that the proposed model has excellent detection performance and is capable of coping with various challenges brought by complex ground environments.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Partial Unbalanced Feature Transport for Cross-Modality Cardiac Image Segmentation
    Dong, Shunjie
    Pan, Zixuan
    Fu, Yu
    Xu, Dongwei
    Shi, Kuangyu
    Yang, Qianqian
    Shi, Yiyu
    Zhuo, Cheng
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (06) : 1758 - 1773
  • [32] Hierarchical Feature Fusion for Cross-Modality Person Re-identification
    Fu, Wen
    Lim, Monghao
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (16)
  • [33] Transformer-Based Visual Grounding with Cross-Modality Interaction
    Li, Kun
    Li, Jiaxiu
    Guo, Dan
    Yang, Xun
    Wang, Meng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
  • [34] Dynamic feature weakening for cross-modality person re-identification*
    Lu, Jian
    Chen, Mengdie
    Wang, Hangying
    Pang, Feifei
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 109
  • [35] Cross-Modality Interaction-Based Traffic Accident Classification
    Oh, Changhyeon
    Ban, Yuseok
    APPLIED SCIENCES-BASEL, 2024, 14 (05):
  • [36] ContextMatcher: Detector-Free Feature Matching With Cross-Modality Context
    Li, Dongyue
    Du, Songlin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 7922 - 7934
  • [37] Cross-Modality Proposal-Guided Feature Mining for Unregistered RGB-Thermal Pedestrian Detection
    Tian, Chao
    Zhou, Zikun
    Huang, Yuqing
    Li, Gaojun
    He, Zhenyu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6449 - 6461
  • [38] CAFCNet: Cross-modality asymmetric feature complement network for RGB-T salient object detection
    Jin, Dongze
    Shao, Feng
    Xie, Zhengxuan
    Mu, Baoyang
    Chen, Hangwei
    Jiang, Qiuping
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 247
  • [39] Enhancing Lidar and Radar Fusion for Vehicle Detection in Adverse Weather via Cross-Modality Semantic Consistency
    Du, Yu
    Yang, Ting
    Chang, Qiong
    Zhong, Wei
    Wang, Weimin
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 439 - 451
  • [40] MSSA: Multispectral Semantic Alignment for Cross-Modality Infrared-RGB Person Reidentification
    Chen, Qingshan
    Zhang, Moyan
    Quan, Zhenzhen
    Zhang, Yumeng
    Mozerov, Mikhail G.
    Zhai, Chao
    Li, Hongjuan
    Li, Yujun
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,