Self-Supervised Underwater Image Generation for Underwater Domain Pre-Training

被引:0
|
作者
Wu, Zhiheng [1 ,2 ]
Wu, Zhengxing [1 ,2 ]
Chen, Xingyu [3 ]
Lu, Yue [1 ,2 ]
Yu, Junzhi [3 ]
机构
[1] Chinese Acad Sci, Inst Automat, Lab Cognit & Decis Intelligence Complex Syst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] Peking Univ, Coll Engn, Dept Adv Mfg & Robot, State Key Lab Turbulence & Complex Syst,BIC ESAT, Beijing 100871, Peoples R China
基金
北京市自然科学基金;
关键词
Object detection; pre-training; self-supervised learning; semantic segmentation; underwater image generation;
D O I
10.1109/TIM.2024.3373105
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The rapid progress in computer vision has presented new opportunities for enhancing the visual capabilities of underwater robots. However, most deep learning-based visual perception algorithms often underperform due to the scarcity of underwater datasets. To address this issue, we propose an underwater image synthesis method for pre-training in the underwater domain. By leveraging self-supervised learning, we simulate the physical imaging process of underwater scenes, allowing for style transfer from in-air images to underwater images using a reduced amount of underwater data. Furthermore, we propose a pre-training strategy that utilizes synthetic underwater images to enhance underwater visual perception. Finally, abundant experiments are conducted, including quantitative and qualitative comparisons. The results validate the effectiveness and superiority of the proposed underwater image synthesis method, highlighting the substantial improvement in underwater environment perception achieved through the underwater domain pre-training (UDP) strategy.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [1] Reducing Domain mismatch in Self-supervised speech pre-training
    Baskar, Murali Karthick
    Rosenberg, Andrew
    Ramabhadran, Bhuvana
    Zhang, Yu
    [J]. INTERSPEECH 2022, 2022, : 3028 - 3032
  • [2] CDS: Cross-Domain Self-supervised Pre-training
    Kim, Donghyun
    Saito, Kuniaki
    Oh, Tae-Hyun
    Plummer, Bryan A.
    Sclaroff, Stan
    Saenko, Kate
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9103 - 9112
  • [3] MEASURING THE IMPACT OF DOMAIN FACTORS IN SELF-SUPERVISED PRE-TRAINING
    Sanabria, Ramon
    Wei-Ning, Hsu
    Alexei, Baevski
    Auli, Michael
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW, 2023,
  • [4] Self-supervised ECG pre-training
    Liu, Han
    Zhao, Zhenbo
    She, Qiang
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 70
  • [5] Self-Supervised Pre-Training Joint Framework: Assisting Lightweight Detection Network for Underwater Object Detection
    Wang, Zhuo
    Chen, Haojie
    Qin, Hongde
    Chen, Qin
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (03)
  • [6] DiT: Self-supervised Pre-training for Document Image Transformer
    Li, Junlong
    Xu, Yiheng
    Lv, Tengchao
    Cui, Lei
    Zhang, Cha
    Wei, Furu
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3530 - 3539
  • [7] Correlational Image Modeling for Self-Supervised Visual Pre-Training
    Li, Wei
    Xie, Jiahao
    Loy, Chen Change
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15105 - 15115
  • [8] Self-supervised Pre-training for Mirror Detection
    Lin, Jiaying
    Lau, Rynson W. H.
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 12193 - 12202
  • [9] Self-supervised Pre-training for Nuclei Segmentation
    Haq, Mohammad Minhazul
    Huang, Junzhou
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 303 - 313
  • [10] EFFECTIVENESS OF SELF-SUPERVISED PRE-TRAINING FOR ASR
    Baevski, Alexei
    Mohamed, Abdelrahman
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7694 - 7698