Approaching Camera-based Real-World Navigation Using Object Recognition

被引:6
|
作者
Zheng, Zejia [1 ]
He, Xie [1 ]
Weng, Juyang [1 ]
机构
[1] Michigan State Univ, E Lansing, MI 48824 USA
关键词
D O I
10.1016/j.procs.2015.07.320
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Traditional autonomous navigation systems for transportation use laser range scanners to construct 3D driving scenes in terms of open and occupied voxels. Active laser range scanners suffer from a series of failures, such as inability to detect wet road surfaces, dark surfaces and objects at large distances In contrast, passive video cameras are immune from these failures but processing is challenging. High dimensionality of the input image requires efficient Big Data analytic methods for the system to perform in real-time. In this paper we argue that object recognition is essential for a navigation system to generalize learned landmarks to new driving scenes, which is a requirement for practical driving. To overcome this difficulty we present an online learning neural network for indoor navigation using only stereo cameras. The network can learn a Finite Automaton (FA) for the driving problem. Transition of the FA depends on several information sources: sensory input (stereo camera images) and motor input (i.e. object, action, GPS, and attention). Our agent simulates the transition of the FA by developing internal representation using the Developmental Network (DN) without handcrafting states or transition rules. Although the proposed network is meant for both indoor and outdoor navigation, it has been only tested in indoor environments in current work. Our experiments demonstrate the agent learned to recognize landmarks and the corresponding actions (e.g. follow the GPS input, correct current direction, and avoid obstacles). Our future work includes training and :learning in outdoor driving scenarios.
引用
收藏
页码:428 / 436
页数:9
相关论文
共 50 条
  • [21] Clickable Real World: Interaction with Real-world Landmarks using Mobile Phone Camera
    Abe, Naoyuki
    Oogami, Wataru
    Shimada, Atsushi
    Nagahara, Hajime
    Taniguchi, Rin-ichiro
    [J]. TENCON 2010: 2010 IEEE REGION 10 CONFERENCE, 2010, : 914 - 917
  • [22] An Anytime Algorithm for Camera-Based Character Recognition
    Kobayashi, Takuya
    Iwamura, Masakazu
    Matsuda, Takahiro
    Kise, Koichi
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1140 - 1144
  • [23] Object-Based Attention in Real-World Scenes
    Malcolm, George L.
    Shomstein, Sarah
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2015, 144 (02) : 257 - 263
  • [24] Camera-based interactive wall display using hand gesture recognition
    Zahra, Rida
    Shehzadi, Afifa
    Sharif, Muhammad Imran
    Karim, Asif
    Azam, Sami
    De Boer, Friso
    Jonkman, Mirjam
    Mehmood, Mehwish
    [J]. INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 19
  • [25] Context mitigates crowding: Peripheral object recognition in real-world images
    Wijntjes, Maarten W. A.
    Rosenholtz, Ruth
    [J]. COGNITION, 2018, 180 : 158 - 164
  • [26] Camera-based gesture recognition for robot control
    Corradini, A
    Gross, HM
    [J]. IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL IV, 2000, : 133 - 138
  • [27] A Camera-based Real-time Polarization Sensor and Its Application to Mobile Robot Navigation
    Zhang, Shuai
    Liang, Huawei
    Zhu, Hui
    Wang, Daobin
    Yu, Biao
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS IEEE-ROBIO 2014, 2014, : 271 - 276
  • [28] Real-World ISAR Object Recognition and Relation Discovery Using Deep Relation Graph Learning
    Xue, Bin
    Tong, Ningning
    [J]. IEEE ACCESS, 2019, 7 : 43906 - 43914
  • [29] Camera-based Optical Music Recognition using a Convolutional Neural Network
    Rico, Adria
    Fornes, Alicia
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 2, 2017, : 27 - 28
  • [30] Foreground Object Detection Under Camouflage Using Multiple Camera-based Codebooks
    Malathi, T.
    Bhuyan, Manas Kamal
    [J]. 2013 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2013,