An efficient approach to detect and segment underwater images using Swin Transformer

被引:2
|
作者
Pavithra, S. [1 ]
Denny, J. Cicil Melbin [1 ]
机构
[1] Vellore Inst Technol, Sch Comp Sci & Engn, Chennai, Tamil Nadu, India
关键词
Artificial Intelligence; Attention mechanisms; ConvMixer; Swin Transformer; Underwater environments;
D O I
10.1016/j.rineng.2024.102460
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In managing the fish environment and underwater ecosystem, there 's been a big shift from old-generational schoolings, hands-on methods to high-technologies, automatic ways of doing things, because of some clever computer technology, using a type of Artificial Intelligence smartness known as Convolutional Neural Networks, or CNNs in short. These smart systems help deal with the tricky task of keeping tabs on where fish stay out in deeper underwater, particularly when it 's hard to see underwater and when the fish are all over the place. There are some pretty good tools and techniques already out there, like DPANet and a few others with names like PIFSNet, EFS -Net, and MFAS-Net, which have been helping a bunch of researches and developments for underwater ecosystem. But Deeper water environment still needs a newer and better way to get the tasks done. So, to answer this major question, this hybrid architecture rolls out a new combination that takes the best bits of transformer architecture known as Swin Transformer and another mechanism known as ConvMixer, making it in a form of SwinUNet Architecture. This Swin Transformer is well defined at picking up on the spatial information from underwater deeper images in our case, which is a key point to be highlighted in really getting the fish 's niche turf and enquiring their ecosystem. And when the underwater turns misty, the specialized implementation of ConvMixer modules incorporated with Swin Transformer increases its ability to sort out the fish from the rest of the underwater world. And this makes it to perform even more better compared with the existing state of arts model, even under Modern Underwater Machine Vision System conditions. This model even over takes to utilization of Few-Shot Learning model for training it in this lesser Underwater Images, with its capability of identifying the patterns and differentiations in both larger and smaller class objects in the given ground truth image. In this new approach, the paper combines the smart focusing methods with a way to blend different signals to better mIoU score, making it super useful for having a deeper understanding on fish-ecosystem and also their underwater ecosystem. While comparing it with the pre-existing state of arts model and the booming Image Segmentation models (YOLO v8, Pix 2 Pix GAN model, Auto Encoders, UNet), the hybrid approach outperformed the existing model results on this Semantic Segmentation for Underwater Imagery (SUIM) dataset, which is been disclosed in the further sections below.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] A computational approach to detect and segment cytoplasm in muscle fiber images
    Guo, Yanen
    Xu, Xiaoyin
    Wang, Yuanyuan
    Yang, Zhong
    Wang, Yaming
    Xia, Shunren
    MICROSCOPY RESEARCH AND TECHNIQUE, 2015, 78 (06) : 508 - 518
  • [22] SWAT: An Efficient Swin Transformer Accelerator Based on FPGA
    Dong, Qiwei
    Xie, Xiaoru
    Wang, Zhongfeng
    29TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2024, 2024, : 515 - 520
  • [23] ASS-CD: Adapting Segment Anything Model and Swin-Transformer for Change Detection in Remote Sensing Images
    Wei, Chenlong
    Wu, Xiaofeng
    Wang, Bin
    REMOTE SENSING, 2025, 17 (03)
  • [24] HYPERSPECTRAL AND MULTISPECTRAL IMAGES FUSION BASED ON PYRAMID SWIN TRANSFORMER
    Lang, Han
    Bao, Wenxing
    Feng, Wei
    Sun, Shasha
    Ma, Xuan
    Zhang, Xiaowu
    IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 3125 - 3128
  • [25] Hyperspectral and multispectral images fusion based on pyramid swin transformer
    Lang, Han
    Bao, Wenxing
    Feng, Wei
    Qu, Kewen
    Ma, Xuan
    Zhang, Xiaowu
    INFRARED PHYSICS & TECHNOLOGY, 2024, 143
  • [26] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
    Liu, Ze
    Lin, Yutong
    Cao, Yue
    Hu, Han
    Wei, Yixuan
    Zhang, Zheng
    Lin, Stephen
    Guo, Baining
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9992 - 10002
  • [27] Automated efficient traffic gesture recognition using swin transformer-based multi-input deep network with radar images
    Firat, Huseyin
    Uzen, Huseyin
    Atila, Orhan
    Sengur, Abdulkadir
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [28] A Swin Transformer, YOLO, and Weighted Boxes Fusion-Based Approach for Tree Detection in Satellite Images
    Durgut, Ozan
    Unsalan, Cem
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [29] REMOTE SENSING IMAGES CHANGE DETECTION USING THE SIAMESE NETWORK COMBINED WITH PURE SWIN TRANSFORMER
    Song, Xu
    Tong, Xinyu
    Hajamydeen, Asif Iqbal
    UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2024, 2024 (04): : 241 - 252
  • [30] BMO based improved Lite Swin transformer for brain tumor detection using MRI images
    Gade, Venkata Subba Reddy
    Cherian, Reni K.
    Rajarao, Bhupathi
    Kumar, Madam Aravind
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 92