Visual Recognition Based on Deep Learning for Navigation Mark Classification

被引:49
|
作者
Pan, Mingyang [1 ]
Liu, Yisai [1 ]
Cao, Jiayi [1 ]
Li, Yu [2 ]
Li, Chao [1 ]
Chen, Chi-Hua [3 ]
机构
[1] Dalian Maritime Univ, Nav Coll, Dalian 116026, Peoples R China
[2] Changjiang Nanjing Waterway Bur, Nanjing 210011, Peoples R China
[3] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350108, Peoples R China
基金
中国国家自然科学基金;
关键词
Navigation; Marine vehicles; Image recognition; Image classification; Visualization; Machine learning; Convolutional neural networks; Deep learning; image classification; multi-scale attention; navigation marks; ResNet; NEURAL-NETWORKS;
D O I
10.1109/ACCESS.2020.2973856
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recognizing objects from camera images is an important field for researching smart ships and intelligent navigation. In sea transportation, navigation marks indicating the features of navigational environments (e.g. channels, special areas, wrecks, etc.) are focused in this paper. A fine-grained classification model named RMA (ResNet-Multiscale-Attention) based on deep learning is proposed to analyse the subtle and local differences among navigation mark types for the recognition of navigation marks. In the RMA model, an attention mechanism based on the fusion of feature maps with three scales is proposed to locate attention regions and capture discriminative characters that are important to distinguish the slight differences among similar navigation marks. Experimental results on a dataset with 10260 navigation mark images showed that the RMA has an accuracy about 96 & x0025; to classify 42 types of navigation marks, and the RMA is better than ResNet-50 model with which the accuracy is about 94 & x0025;. The visualization analyses showed that the RMA model can extract the attention regions and the characters of navigation marks.
引用
收藏
页码:32767 / 32775
页数:9
相关论文
共 50 条
  • [31] DeepNAVI: A deep learning based smartphone navigation assistant for people with visual impairments
    Kuriakose, Bineeth
    Shrestha, Raju
    Sandnes, Frode Eika
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212
  • [32] Indoor Localization and Navigation based on Deep Learning using a Monocular Visual System
    Arevalo Ancona, Rodrigo Eduardo
    Corona Ramirez, Leonel German
    Gutierrez Frias, Oscar Octavio
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (06) : 79 - 86
  • [33] Forward Collision Warning and Lane-mark Recognition Systems Based on Deep Learning
    Pai, Neng-Sheng
    Huang, Jing-Bin
    Wu, Jian-Xing
    Chen, Pi-Yun
    Zhou, Yue-Han
    SENSORS AND MATERIALS, 2020, 32 (06) : 1981 - 1995
  • [34] Deep Reinforcement Learning for Visual Semantic Navigation with Memory
    de Andrade Santos, Iury Batista
    Romero, Roseli A. F.
    2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020), 2020, : 114 - 119
  • [35] Deep Visual MPC-Policy Learning for Navigation
    Hirose, Noriaki
    Xia, Fei
    Martin-Martin, Roberto
    Sadeghian, Amir
    Savarese, Silvio
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04): : 3184 - 3191
  • [36] Deep Learning for Visual Navigation of Unmanned Ground Vehicles
    Mahony, Niall O'
    Campbell, Sean
    Krpalkova, Lenka
    Riordan, Daniel
    Walsh, Joseph
    Murphy, Aidan
    Ryan, Conor
    2018 29TH IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2018,
  • [37] Infrared Image Recognition Technology Based on Visual Processing and Deep Learning
    He Feng
    Hu Xuran
    Liu Bin
    Wang Haipeng
    Zhang Decai
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 641 - 645
  • [38] Deep Learning-Based Approach for Arabic Visual Speech Recognition
    Alsulami, Nadia H.
    Jamal, Amani T.
    Elrefaei, Lamiaa A.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (01): : 85 - 108
  • [39] OMRNet: A lightweight deep learning model for optical mark recognition
    Mondal, Sayan
    De, Pratyay
    Malakar, Samir
    Sarkar, Ram
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (05) : 14011 - 14045
  • [40] Video Deep Learning Classification for Autonomous Vehicle Navigation
    Salem, Fathi M.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 1014 - 1017