Visuomotor Navigation for Embodied Robots With Spatial Memory and Semantic Reasoning Cognition

被引:1
|
作者
Liu, Qiming [1 ]
Wang, Guangzhan [2 ]
Liu, Zhe [3 ]
Wang, Hesheng [4 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Software, Shanghai 200240, Peoples R China
[3] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[4] Shanghai Jiao Tong Univ, Minist Educ, Key Lab Marine Intelligent Equipment & Syst, Shanghai Engn Res Ctr Intelligent Control & Manage, Shanghai 200240, Peoples R China
关键词
Cognition; Semantics; Navigation; Robots; Decision making; Pipelines; Visualization; Cognitive ability; semantic reasoning; topological memory; visuomotor navigation;
D O I
10.1109/TNNLS.2024.3418857
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The fundamental prerequisite for embodied agents to make intelligent decisions lies in autonomous cognition. Typically, agents optimize decision-making by leveraging extensive spatiotemporal information from episodic memory. Concurrently, they utilize long-term experience for task reasoning and foster conscious behavioral tendencies. However, due to the significant disparities in the heterogeneous modalities of these two cognitive abilities, existing literature falls short in designing effective coupling mechanisms, thus failing to endow robots with comprehensive intelligence. This article introduces a navigation framework, the hierarchical topology-semantic cognitive navigation (HTSCN), which seamlessly integrates both memory and reasoning abilities within a singular end-to-end system. Specifically, we represent memory and reasoning abilities with a topological map and a semantic relation graph, respectively, within a unified dual-layer graph structure. Additionally, we incorporate a neural-based cognition extraction process to capture cross-modal relationships between hierarchical graphs. HTSCN forges a link between two different cognitive modalities, thus further enhancing decision-making performance and the overall level of intelligence. Experimental results demonstrate that in comparison to existing cognitive structures, HTSCN significantly enhances the performance and path efficiency of image-goal navigation. Visualization and interpretability experiments further corroborate the promoting role of memory, reasoning, as well as their online learned relationships, on intelligent behavioral patterns. Furthermore, we deploy HTSCN in real-world scenarios to further verify its feasibility and adaptability.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Emergence and development of embodied cognition: a constructivist approach using robots
    Kuniyoshi, Yasuo
    Yorozu, Yasuaki
    Suzuki, Shinsuke
    Sangawa, Shinji
    Ohmura, Yoshiyuki
    Terada, Koji
    Nagakubo, Akihiko
    FROM ACTION TO COGNITION, 2007, 164 : 425 - 445
  • [32] Contributions of Spatial Working Memory to Visuomotor Learning
    Anguera, Joaquin A.
    Reuter-Lorenz, Patricia A.
    Willingham, Daniel T.
    Seidler, Rachael D.
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2010, 22 (09) : 1917 - 1930
  • [33] Immersion as an embodied cognition shift: aesthetic experience and spatial situated cognition
    Bruno Trentini
    Cognitive Processing, 2015, 16 : 413 - 416
  • [34] Immersion as an embodied cognition shift: aesthetic experience and spatial situated cognition
    Trentini, Bruno
    COGNITIVE PROCESSING, 2015, 16 : S413 - S416
  • [35] Immersion as an embodied cognition shift: Aesthetic experience and spatial situated cognition
    Trentini, Bruno
    COGNITIVE PROCESSING, 2015, 16 : S44 - S44
  • [36] Spatial Cognition, Navigation, and Environmental Knowledge
    Denis, Michel
    Tversky, Barbara
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2016, 51 : 161 - 161
  • [37] Language and Spatial Cognition in Route Navigation
    宋娟
    海外英语, 2013, (13) : 281 - 282
  • [38] Reasoning on spatial semantic integrity constraints
    Maes, Stephan
    SPATIAL INFORMATION THEORY, PROCEEDINGS, 2007, 4736 : 285 - 302
  • [39] Rapid training of supramodal spatial cognition and memory for improved navigation in low vision and blindness
    Likova, Lora
    Mineff, Kristyo
    Nicholas, Spero
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2020, 61 (07)
  • [40] Semantic reasoning in service robots using expert systems
    Savage, Jesus
    Rosenblueth, David A.
    Matamoros, Mauricio
    Negrete, Marco
    Contreras, Luis
    Cruz, Julio
    Martell, Reynaldo
    Estrada, Hugo
    Okada, Hiroyuki
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2019, 114 : 77 - 92