LRTransDet: A Real-Time SAR Ship-Detection Network with Lightweight ViT and Multi-Scale Feature Fusion

被引:10
|
作者
Feng, Kunyu [1 ]
Lun, Li [2 ]
Wang, Xiaofeng [3 ]
Cui, Xiaoxin [2 ]
机构
[1] Peking Univ, Sch Software & Microeletron, Beijing 102600, Peoples R China
[2] Peking Univ, Sch Integrated Circuits, Beijing 100871, Peoples R China
[3] Beijing Aerosp Automat Control Inst, Beijing 100039, Peoples R China
关键词
synthetic aperture radar (SAR); ship detection; vision transformer (ViT); faster weighted feature fusion (Faster-WF2); coordinate attention (CA); real-time;
D O I
10.3390/rs15225309
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In recent years, significant strides have been made in the field of synthetic aperture radar (SAR) ship detection through the application of deep learning techniques. These advanced methods have substantially improved the accuracy of ship detection. Nonetheless, SAR images present distinct challenges, including complex backgrounds, small ship targets, and noise interference, thereby rendering the detectors particularly demanding. In this paper, we introduce LRTransDet, a real-time SAR ship detector. LRTransDet leverages a lightweight vision transformer (ViT) and a multi-scale feature fusion neck to address these challenges effectively. First, our model implements a lightweight backbone that combines convolutional neural networks (CNNs) and transformers, thus enabling it to simultaneously capture both local and global features from input SAR images. Moreover, we boost the model's efficiency by incorporating the faster weighted feature fusion (Faster-WF2) module and coordinate attention (CA) mechanism within the feature fusion neck. These components optimize computational resources while maintaining the model's performance. To overcome the challenge of detecting small ship targets in SAR images, we refine the original loss function and use the normalized Wasserstein distance (NWD) metric and the intersection over union (IoU) scheme. This combination improves the detector's ability to efficiently detect small targets. To prove the performance of our proposed model, we conducted experiments on four challenging datasets (the SSDD, the SAR-Ship Dataset, the HRSID, and the LS-SSDD-v1.0). The results demonstrate that our model surpasses both general object detectors and state-of-the-art SAR ship detectors in terms of detection accuracy (97.8% on the SSDD and 93.9% on the HRSID) and speed (74.6 FPS on the SSDD and 75.8 FPS on the HRSID), all while demanding 3.07 M parameters. Additionally, we conducted a series of ablation experiments to illustrate the impact of the EfficientViT, the Faster-WF2 module, the CA mechanism, and the NWD metric on multi-scale feature fusion and detection performance.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Multi-Scale Feature Fusion Lightweight Real-Time Infrared Pedestrian Detection at Night
    He Z.
    Chen G.
    Chen J.
    Zhang Y.
    Zhongguo Jiguang/Chinese Journal of Lasers, 2022, 49 (17):
  • [2] Real-Time Robotic Grasp Detection with Multi-Scale Feature Fusion
    Ma, Hao
    Yuan, Ding
    Cao, Zhe
    Yin, Jihao
    2020 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE-RCAR 2020), 2020, : 140 - 145
  • [3] Multi-scale Lightweight Neural Network for Real-Time Object Detection
    Li, Yuan
    Wu, Qiaojun
    Chen, Song
    Kang, Yi
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 199 - 211
  • [4] MSCFNet: A Lightweight Network With Multi-Scale Context Fusion for Real-Time Semantic Segmentation
    Gao, Guangwei
    Xu, Guoan
    Yu, Yi
    Xie, Jin
    Yang, Jian
    Yue, Dong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25489 - 25499
  • [5] Ship Detection in SAR Images Based on Multi-Scale Feature Extraction and Adaptive Feature Fusion
    Zhou, Kexue
    Zhang, Min
    Wang, Hai
    Tan, Jinlin
    REMOTE SENSING, 2022, 14 (03)
  • [6] Real-Time Conveyor Belt Deviation Detection Algorithm Based on Multi-Scale Feature Fusion Network
    Zeng, Chan
    Zheng, Junfeng
    Li, Jiangyun
    ALGORITHMS, 2019, 12 (10)
  • [7] A Fast and Lightweight Detection Network for Multi-Scale SAR Ship Detection under Complex Backgrounds
    Yu, Jimin
    Zhou, Guangyu
    Zhou, Shangbo
    Qin, Maowei
    REMOTE SENSING, 2022, 14 (01)
  • [8] EMFANet: a lightweight network with efficient multi-scale feature aggregation for real-time semantic segmentation
    Xuegang Hu
    Yan Ke
    Journal of Real-Time Image Processing, 2024, 21
  • [9] EMFANet: a lightweight network with efficient multi-scale feature aggregation for real-time semantic segmentation
    Hu, Xuegang
    Ke, Yan
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (02)
  • [10] Real-Time Vehicle Object Detection Method Based on Multi-Scale Feature Fusion
    Guo, Keyou
    Li, Xue
    Zhang, Mo
    Bao, Qichao
    Yang, Min
    IEEE Access, 2021, 9 : 115126 - 115134