Visuomotor Navigation for Embodied Robots With Spatial Memory and Semantic Reasoning Cognition

被引：1

作者：

Liu, Qiming ^{[1
]}

Wang, Guangzhan ^{[2
]}

Liu, Zhe ^{[3
]}

Wang, Hesheng ^{[4
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China

[2] Shanghai Jiao Tong Univ, Sch Software, Shanghai 200240, Peoples R China

[3] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China

[4] Shanghai Jiao Tong Univ, Minist Educ, Key Lab Marine Intelligent Equipment & Syst, Shanghai Engn Res Ctr Intelligent Control & Manage, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年

关键词：

Cognition; Semantics; Navigation; Robots; Decision making; Pipelines; Visualization; Cognitive ability; semantic reasoning; topological memory; visuomotor navigation;

D O I：

10.1109/TNNLS.2024.3418857

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The fundamental prerequisite for embodied agents to make intelligent decisions lies in autonomous cognition. Typically, agents optimize decision-making by leveraging extensive spatiotemporal information from episodic memory. Concurrently, they utilize long-term experience for task reasoning and foster conscious behavioral tendencies. However, due to the significant disparities in the heterogeneous modalities of these two cognitive abilities, existing literature falls short in designing effective coupling mechanisms, thus failing to endow robots with comprehensive intelligence. This article introduces a navigation framework, the hierarchical topology-semantic cognitive navigation (HTSCN), which seamlessly integrates both memory and reasoning abilities within a singular end-to-end system. Specifically, we represent memory and reasoning abilities with a topological map and a semantic relation graph, respectively, within a unified dual-layer graph structure. Additionally, we incorporate a neural-based cognition extraction process to capture cross-modal relationships between hierarchical graphs. HTSCN forges a link between two different cognitive modalities, thus further enhancing decision-making performance and the overall level of intelligence. Experimental results demonstrate that in comparison to existing cognitive structures, HTSCN significantly enhances the performance and path efficiency of image-goal navigation. Visualization and interpretability experiments further corroborate the promoting role of memory, reasoning, as well as their online learned relationships, on intelligent behavioral patterns. Furthermore, we deploy HTSCN in real-world scenarios to further verify its feasibility and adaptability.

引用

页数：12

共 50 条

[31] Emergence and development of embodied cognition: a constructivist approach using robots
Kuniyoshi, Yasuo
Yorozu, Yasuaki
Suzuki, Shinsuke
Sangawa, Shinji
Ohmura, Yoshiyuki
Terada, Koji
Nagakubo, Akihiko
FROM ACTION TO COGNITION, 2007, 164 : 425 - 445
[32] Contributions of Spatial Working Memory to Visuomotor Learning
Anguera, Joaquin A.
Reuter-Lorenz, Patricia A.
Willingham, Daniel T.
Seidler, Rachael D.
JOURNAL OF COGNITIVE NEUROSCIENCE, 2010, 22 (09) : 1917 - 1930
[33] Immersion as an embodied cognition shift: aesthetic experience and spatial situated cognition
Bruno Trentini
Cognitive Processing, 2015, 16 : 413 - 416
[34] Immersion as an embodied cognition shift: aesthetic experience and spatial situated cognition
Trentini, Bruno
COGNITIVE PROCESSING, 2015, 16 : S413 - S416
[35] Immersion as an embodied cognition shift: Aesthetic experience and spatial situated cognition
Trentini, Bruno
COGNITIVE PROCESSING, 2015, 16 : S44 - S44
[36] Spatial Cognition, Navigation, and Environmental Knowledge
Denis, Michel
Tversky, Barbara
INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2016, 51 : 161 - 161
[37] Language and Spatial Cognition in Route Navigation
宋娟
海外英语, 2013, (13) : 281 - 282
[38] Reasoning on spatial semantic integrity constraints
Maes, Stephan
SPATIAL INFORMATION THEORY, PROCEEDINGS, 2007, 4736 : 285 - 302
[39] Rapid training of supramodal spatial cognition and memory for improved navigation in low vision and blindness
Likova, Lora
Mineff, Kristyo
Nicholas, Spero
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2020, 61 (07)
[40] Semantic reasoning in service robots using expert systems
Savage, Jesus
Rosenblueth, David A.
Matamoros, Mauricio
Negrete, Marco
Contreras, Luis
Cruz, Julio
Martell, Reynaldo
Estrada, Hugo
Okada, Hiroyuki
ROBOTICS AND AUTONOMOUS SYSTEMS, 2019, 114 : 77 - 92

← 1 2 3 4 5 →