Visual Recognition Based on Deep Learning for Navigation Mark Classification

被引:49
|
作者
Pan, Mingyang [1 ]
Liu, Yisai [1 ]
Cao, Jiayi [1 ]
Li, Yu [2 ]
Li, Chao [1 ]
Chen, Chi-Hua [3 ]
机构
[1] Dalian Maritime Univ, Nav Coll, Dalian 116026, Peoples R China
[2] Changjiang Nanjing Waterway Bur, Nanjing 210011, Peoples R China
[3] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350108, Peoples R China
基金
中国国家自然科学基金;
关键词
Navigation; Marine vehicles; Image recognition; Image classification; Visualization; Machine learning; Convolutional neural networks; Deep learning; image classification; multi-scale attention; navigation marks; ResNet; NEURAL-NETWORKS;
D O I
10.1109/ACCESS.2020.2973856
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recognizing objects from camera images is an important field for researching smart ships and intelligent navigation. In sea transportation, navigation marks indicating the features of navigational environments (e.g. channels, special areas, wrecks, etc.) are focused in this paper. A fine-grained classification model named RMA (ResNet-Multiscale-Attention) based on deep learning is proposed to analyse the subtle and local differences among navigation mark types for the recognition of navigation marks. In the RMA model, an attention mechanism based on the fusion of feature maps with three scales is proposed to locate attention regions and capture discriminative characters that are important to distinguish the slight differences among similar navigation marks. Experimental results on a dataset with 10260 navigation mark images showed that the RMA has an accuracy about 96 & x0025; to classify 42 types of navigation marks, and the RMA is better than ResNet-50 model with which the accuracy is about 94 & x0025;. The visualization analyses showed that the RMA model can extract the attention regions and the characters of navigation marks.
引用
收藏
页码:32767 / 32775
页数:9
相关论文
共 50 条
  • [1] Door recognition and deep learning algorithm for visual based robot navigation
    Chen, Wei
    Qu, Ting
    Zhou, Yimin
    Weng, Kaijian
    Wang, Gang
    Fu, Guoqiang
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS IEEE-ROBIO 2014, 2014, : 1793 - 1798
  • [2] Multilabel Video Classification Model of Navigation Mark's Lights Based on Deep Learning
    Han, Xu
    Pan, Mingyang
    Ge, Haipeng
    Li, Shaoxi
    Hu, Jingfeng
    Zhao, Lining
    Li, Yu
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [3] Research on Road Sign Recognition of Visual Navigation Vehicle Based on the YOLO Deep Learning Algorithm
    Shi, Zhenjiang
    Lu, Fei
    Liu, Yingqun
    Huang, Haojing
    Zheng, Jiayi
    Chen, Xiting
    Li, Jiangtao
    Lin, Yongsheng
    Zhu, Anxia
    Ke, Lin
    Chen, Zhinan
    Chen, Xuhua
    Yang, Zhenwang
    Wu, Wenxin
    PROCEEDINGS OF 2023 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION TECHNOLOGY, ICIIT 2023, 2023, : 139 - 144
  • [4] Design of a Smart Navigation Mark Light System Based on Deep Learning
    Wu, Yiheng
    XinLi
    Yin, Zhangjie
    Li, Jian
    Liu, Chun
    2019 2ND INTERNATIONAL CONFERENCE ON MECHANICAL ENGINEERING, INDUSTRIAL MATERIALS AND INDUSTRIAL ELECTRONICS (MEIMIE 2019), 2019, : 29 - 36
  • [5] Texture Recognition and Classification Based on Deep Learning
    Zhu, Gaoming
    Li, Bingchan
    Hong, Shuai
    Mao, Bo
    2018 SIXTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2018, : 344 - 348
  • [6] Visual Navigation With Multiple Goals Based on Deep Reinforcement Learning
    Rao, Zhenhuan
    Wu, Yuechen
    Yang, Zifei
    Zhang, Wei
    Lu, Shijian
    Lu, Weizhi
    Zha, ZhengJun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (12) : 5445 - 5455
  • [7] Distributed Deep Reinforcement Learning based Indoor Visual Navigation
    Hsu, Shih-Hsi
    Chan, Shoo-Hung
    Wu, Ping-Tsang
    Xiao, Kun
    Fu, Li-Chen
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 2532 - 2537
  • [8] Deep learning based decomposition for visual navigation in industrial platforms
    Youcef Djenouri
    Johan Hatleskog
    Jon Hjelmervik
    Elias Bjorne
    Trygve Utstumo
    Milad Mobarhan
    Applied Intelligence, 2022, 52 : 8101 - 8117
  • [9] Deep learning based decomposition for visual navigation in industrial platforms
    Djenouri, Youcef
    Hatleskog, Johan
    Hjelmervik, Jon
    Bjorne, Elias
    Utstumo, Trygve
    Mobarhan, Milad
    APPLIED INTELLIGENCE, 2022, 52 (07) : 8101 - 8117
  • [10] A Survey of Visual Affordance Recognition Based on Deep Learning
    Chen, Dongpan
    Kong, Dehui
    Li, Jinghua
    Wang, Shaofan
    Yin, Baocai
    IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (06) : 1458 - 1476