Efficient cross-modality feature interaction for multispectral armored vehicle detection

被引:0
|
作者
Zhang, Jie [1 ]
Chang, Tian-qing [1 ]
Zhao, Li-yang [2 ]
Ma, Jin-dun [3 ]
Han, Bin [1 ]
Zhang, Lei [1 ]
机构
[1] Army Acad Armored Forces, Beijing 100072, Peoples R China
[2] PLA Naval Submarine Acad, Qingdao 266199, Peoples R China
[3] PLA, Unit 63966, Beijing 100072, Peoples R China
关键词
Cross-modality; Armored vehicle detection; Feature interaction; Multispectral; RECOGNITION;
D O I
10.1016/j.asoc.2024.111971
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting armed vehicles from a UAV platform is challenging due to the complexity of ground environment. This paper presents a dual-stream multispectral armored vehicle detection method to tackle this problem. First, considering that there is a paucity of datasets containing multispectral armored vehicle images, a multispectral armored vehicle detection dataset is constructed for this study. The dataset consists of 5853 pairs of RGB and infrared images, featuring a total of 15,878 instances of armored vehicles. Then, a cross-modal feature interaction module is designed to enable efficient feature interaction between multispectral images. This module uses the cross-modal channel-wise feature difference method to model the channel differences between the two modal features and obtains the cross-modal channel difference matrix. The cross-modal channel difference matrix is then employed to extract the unique features of the two modal features, allowing for efficient cross-modal feature interaction by complementing each other's unique features. Experiment results demonstrate that the proposed model has excellent detection performance and is capable of coping with various challenges brought by complex ground environments.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Cross-Modality Feature Learning via Convolutional Autoencoder
    Liu, Xueliang
    Wang, Meng
    Zha, Zheng-Jun
    Hong, Richang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (01)
  • [22] Cross-Modality Fourier Feature for Medical Image Synthesis
    Ma, Mei
    Lin, Ling
    Wang, Heng
    Li, Zhendong
    Liu, Hao
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1475 - 1480
  • [23] Cross-Modality Object Detection Based on DETR
    Huang, Xinyi
    Ma, Guochun
    IEEE ACCESS, 2025, 13 : 51220 - 51230
  • [24] Cross-modality deep feature learning for brain tumor segmentation
    Zhang, Dingwen
    Huang, Guohai
    Zhang, Qiang
    Han, Jungong
    Han, Junwei
    Yu, Yizhou
    PATTERN RECOGNITION, 2021, 110
  • [25] Cross-Modality Interaction Network for Pan-Sharpening
    Wang, Yingying
    He, Xuanhua
    Dong, Yuhang
    Lin, Yunlong
    Huang, Yue
    Ding, Xinghao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [26] Cross-modality image feature fusion diagnosis in breast cancer
    Jiang, Mingkuan
    Han, Lu
    Sun, Hang
    Li, Jing
    Bao, Nan
    Li, Hong
    Zhou, Shi
    Yu, Tao
    PHYSICS IN MEDICINE AND BIOLOGY, 2021, 66 (10):
  • [27] Asymmetric cross-modality interaction network for RGB-D salient object detection
    Su, Yiming
    Gao, Haoran
    Wang, Mengyin
    Wang, Fasheng
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 275
  • [28] Cross-modality Discrepant Interaction Network for RGB-D Salient Object Detection
    Zhang, Chen
    Cong, Runmin
    Lin, Qinwei
    Ma, Lin
    Li, Feng
    Zhao, Yao
    Kwong, Sam
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2094 - 2102
  • [29] A 3D Cross-Modality Feature Interaction Network With Volumetric Feature Alignment for Brain Tumor and Tissue Segmentation
    Zhuang, Yuzhou
    Liu, Hong
    Song, Enmin
    Hung, Chih-Cheng
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (01) : 75 - 86
  • [30] Cross-Modality 3D Object Detection
    Zhu, Ming
    Ma, Chao
    Ji, Pan
    Yang, Xiaokang
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3771 - 3780