Hybrid Multiscale SAR Ship Detector With CNN-Transformer and Adaptive Fusion Loss

被引:0
|
作者
Wang, Fei [1 ]
Chen, Chengcheng [1 ]
Zeng, Weiming [1 ]
机构
[1] Shanghai Maritime Univ, Digital Imaging & Intelligent Comp Lab, Shanghai 201306, Peoples R China
关键词
Marine vehicles; Feature extraction; Detectors; Convolution; Transformers; Computational modeling; Synthetic aperture radar; Deep learning; multiscale feature fusion; ship detection; synthetic aperture radar (SAR);
D O I
10.1109/LGRS.2024.3450716
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Ship detection in remote sensing imagery is crucial for various maritime applications such as surveillance and navigation. Convolutional neural networks (CNNs) and transformers have shown significant potential in object detection within the field of image processing. However, existing models applied directly to ship detection in synthetic aperture radar (SAR) imagery encounter challenges due to the varying sizes of ship targets. This often leads to issues such as low detection accuracy, missed detections, and false alarms. In this letter, we propose a new detection network, HMA-Net, to further address these issues. Initially, we introduce the Cwin module, which enhances interference resistance at a relatively low cost, enabling the model to more accurately capture target information. Subsequently, we design a multiscale ship feature extraction module, which uses a parallel multibranch structure to extract features of ships of various sizes and shapes. Finally, we introduce an adaptive fusion loss function that flexibly allocates loss calculation methods to detected targets, thereby enhancing the robustness of the model and achieving high-quality detection boxes. The proposed HMA-Net achieved improvements of 2.0% and 0.9% in mAP(.50:.95) over the baseline models on the SAR Ship Detection dataset and the High-Resolution SAR Images dataset, using only 3.52 M parameters.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] A hybrid CNN-Transformer model for Historical Document Image Binarization
    Rezanezhad, Vahid
    Baierer, Konstantin
    Neudecker, Clemens
    PROCEEDINGS OF THE 2023 INTERNATIONAL WORKSHOP ON HISTORICAL DOCUMENT IMAGING AND PROCESSING, HIP 2023, 2023, : 79 - 84
  • [22] Harmful Cyanobacterial Blooms forecasting based on improved CNN-Transformer and Temporal Fusion Transformer
    Ahn, Jung Min
    Kim, Jungwook
    Kim, Hongtae
    Kim, Kyunghyun
    ENVIRONMENTAL TECHNOLOGY & INNOVATION, 2023, 32
  • [23] DBCT-Net:A dual branch hybrid CNN-transformer network for remote sensing image fusion
    Wang, Quanli
    Jin, Xin
    Jiang, Qian
    Wu, Liwen
    Zhang, Yunchun
    Zhou, Wei
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 233
  • [24] A CNN-Transformer Combined Remote Sensing Imagery Spatiotemporal Fusion Model
    Jiang, Mingyu
    Shao, Hua
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 13995 - 14009
  • [25] Infrared and Visible Image Fusion Based on Autoencoder Composed of CNN-Transformer
    Wang, Hongmei
    Li, Lin
    Li, Chenkai
    Lu, Xuanyu
    IEEE ACCESS, 2023, 11 : 78956 - 78969
  • [26] A CNN-Transformer Hybrid Model Based on CSWin Transformer for UAV Image Object Detection
    Lu, Wanjie
    Lan, Chaozhen
    Niu, Chaoyang
    Liu, Wei
    Lyu, Liang
    Shi, Qunshan
    Wang, Shiju
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 1211 - 1231
  • [27] A novel hybrid CNN-Transformer model for EEG Motor Imagery classification
    Ma, Yaxin
    Song, Yonghao
    Gao, Fei
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [28] HCTNet: A hybrid CNN-transformer network for breast ultrasound image segmentation
    He, Qiqi
    Yang, Qiuju
    Xie, Minghao
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 155
  • [29] A Hybrid CNN-Transformer Architecture for Semantic Segmentation of Radar Sounder data
    Ghosh, Raktim
    Bovolo, Francesca
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1320 - 1323
  • [30] CTMANet: A CNN-Transformer Hybrid Semantic Segmentation Network for Fine-Grained Airport Extraction in Complex SAR Scenes
    Wu, Keyu
    Cai, Feng
    Wang, Haipeng
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 4689 - 4704