Fine-grained ship image classification and detection based on a vision transformer and multi-grain feature vector FPN model

被引:2
|
作者
Wang, Fengxiang [1 ]
Yu, Deying [2 ]
Huang, Liang [3 ]
Zhang, Yalun [4 ]
Chen, Yongbing [2 ]
Wang, Zhiguo [5 ]
机构
[1] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha, Peoples R China
[2] Naval Univ Engn, Sch Elect Engn, Wuhan, Peoples R China
[3] Naval Univ Engn, Coll Elect Engn, Wuhan, Peoples R China
[4] Peoples Liberat Army Naval Command Coll, Combat Command Dept, Nanjing, Peoples R China
[5] Naval Univ Engn, Dept Operat Res & Planning, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; image classification; ship detection; remote-sensing images; transformer; REMOTE-SENSING IMAGES; NETWORK;
D O I
10.1080/10095020.2024.2331552
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
In naval and civilian domains, meticulous ship classification and detection are paramount. Nevertheless, predominant research has gravitated toward leveraging Convolutional Neural Network (CNN)-centered methodologies, often overlooking the diverse granularity inherent in ship samples. In our pursuit to holistically extract features from ship images across varying granularities, we present a transformative architecture: the Vision Transformer and Multi-Grain Feature Vector Feature Pyramid Network (ViT-MGFV-FPN). This model synergistically melds the merits of MGFV-FPN with an augmented Vision Transformer (ViT) for a comprehensive image feature extraction. To cater to the extraction of broader image features whilst sidestepping the innate quadratic complexity of traditional ViT, we unveil an enhanced version christened the Global Swin Transformer. Concurrently, the MGFV-FPN is orchestrated to harness the prowess of CNNs in distilling intricate ship attributes. Rigorous empirical evaluations underscore our model's superiority in juxtaposition with extant CNN and transformer-based paradigms for nuanced ship categorization.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Fine-Grained Image Classification for Pollen Grain Microscope Images
    Trenta, Francesca
    Ortis, Alessandro
    Battiato, Sebastiano
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2021, PT 1, 2021, 13052 : 341 - 351
  • [22] Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification
    Ji, Ruyi
    Li, Jiaying
    Zhang, Libo
    Liu, Jing
    Wu, Yanjun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 5009 - 5021
  • [23] Fine-Grained Clothing Image Classification by Style Feature Description
    Wu M.
    Liu L.
    Fu X.
    Liu L.
    Huang Q.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (05): : 780 - 791
  • [24] Coordinate feature fusion networks for fine-grained image classification
    Kaiyang Liao
    Gang Huang
    Yuanlin Zheng
    Guangfeng Lin
    Congjun Cao
    Signal, Image and Video Processing, 2023, 17 : 807 - 815
  • [25] Coordinate feature fusion networks for fine-grained image classification
    Liao, Kaiyang
    Huang, Gang
    Zheng, Yuanlin
    Lin, Guangfeng
    Cao, Congjun
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (03) : 807 - 815
  • [26] Learning Semantically Enhanced Feature for Fine-Grained Image Classification
    Luo, Wei
    Zhang, Hengmin
    Li, Jun
    Wei, Xiu-Shen
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (27) : 1545 - 1549
  • [27] Fine-Grained Image Classification Combining Swin and Multi-Scale Feature Fusion
    Xiang, Jianwen
    Chen, Minrong
    Yang, Baibing
    Computer Engineering and Applications, 2023, 59 (20): : 147 - 157
  • [28] Fine-grained visual clasificatio based on compct Vision transformer
    Xu H.
    Guo L.
    Li R.-Z.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (03): : 893 - 900
  • [29] Efficient Prompt Tuning of Large Vision-Language Model for Fine-Grained Ship Classification
    Lan, Long
    Wang, Fengxiang
    Zheng, Xiangtao
    Wang, Zengmao
    Liu, Xinwang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [30] Application of Image Classification for Fine-Grained Nudity Detection
    Ion, Cristian
    Minea, Cristian
    ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT I, 2020, 11844 : 3 - 15