A framework for learning semantic maps from grounded natural language descriptions

被引：24

作者：

Walter, Matthew R. ^{[1
]}

Hemachandra, Sachithra ^{[1
]}

Homberg, Bianca ^{[1
]}

Tellex, Stefanie ^{[2
]}

Teller, Seth ^{[1
]}

机构：

[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA

[2] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA

来源：

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH | 2014年 / 33卷 / 09期

关键词：

Semantic mapping; Rao-Blackwellization; mapping; particle filter; natural language understanding; human-robot interaction; SIMULTANEOUS LOCALIZATION; SPACE;

D O I：

10.1177/0278364914537359

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

This paper describes a framework that enables robots to efficiently learn human-centric models of their environment from natural language descriptions. Typical semantic mapping approaches are limited to augmenting metric maps with higher-level properties of the robot's surroundings (e. g. place type, object locations) that can be inferred from the robot's sensor data, but do not use this information to improve the metric map. The novelty of our algorithm lies in fusing high-level knowledge that people can uniquely provide through speech with metric information from the robot's low-level sensor streams. Our method jointly estimates a hybrid metric, topological, and semantic representation of the environment. This semantic graph provides a common framework in which we integrate information that the user communicates (e. g. labels and spatial relations) with metric observations from low-level sensors. Our algorithm efficiently maintains a factored distribution over semantic graphs based upon the stream of natural language and low-level sensor information. We detail the means by which the framework incorporates knowledge conveyed by the user's descriptions, including the ability to reason over expressions that reference yet unknown regions in the environment. We evaluate the algorithm's ability to learn human-centric maps of several different environments and analyze the knowledge inferred from language and the utility of the learned maps. The results demonstrate that the incorporation of information from free-form descriptions increases the metric, topological, and semantic accuracy of the recovered environment model.

引用

页码：1167 / 1190

页数：24

共 50 条

[1] Learning Spatial-Semantic Representations from Natural Language Descriptions and Scene Classifications
Hemachandra, Sachithra
Walter, Matthew R.
Tellex, Stefanie
Teller, Seth
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 2623 - 2630
[2] Semantic Novelty Detection in Natural Language Descriptions
Ma, Nianzu
Politowicz, Alexander
Mazumder, Sahisnu
Chen, Jiahua
Liu, Bing
Robertson, Eric
Grigsby, Scott
[J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 866 - 882
[3] Learning to Read Maps: Understanding Natural Language Instructions from Unseen Maps
Katsakioris, Miltiadis Marios
Konstas, Ioannis
Mignotte, Pierre Yves
Hastie, Helen
[J]. SPLU-ROBONLP 2021: THE 2ND INTERNATIONAL COMBINED WORKSHOP ON SPATIAL LANGUAGE UNDERSTANDING AND GROUNDED COMMUNICATION FOR ROBOTICS, 2021, : 11 - 21
[4] A framework for creating natural language descriptions of video streams
Khan, Muhammad Usman Ghani
Al Harbi, Nouf
Gotoh, Yoshihiko
[J]. INFORMATION SCIENCES, 2015, 303 : 61 - 82
[5] Learning semantic sentence representations from visually grounded language without lexical knowledge
Merkx, Danny
Frank, Stefan L.
[J]. NATURAL LANGUAGE ENGINEERING, 2019, 25 (04) : 451 - 466
[6] Learning to Learn Semantic Parsers from Natural Language Supervision
Labutov, Igor
Yang, Bishan
Mitchell, Tom
[J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1676 - 1690
[7] Using Semantic Maps for Robust Natural Language Interaction with Robots
Bastianelli, Emanuele
Croce, Danilo
Basili, Roberto
Nardi, Daniele
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1393 - 1397
[8] Semantic vector learning for natural language understanding
Jung, Sangkeun
[J]. COMPUTER SPEECH AND LANGUAGE, 2019, 56 : 130 - 145
[9] Semantic processing of natural language queries in the OntoNL framework
Karanastasi, Anastasia
Christodoulakis, Stavros
[J]. ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, : 686 - +
[10] AN ATTEMPT AT A FRAMEWORK FOR SEMANTIC INTERPRETATION OF NATURAL-LANGUAGE
PEREGRIN, J
SGALL, P
[J]. THEORETICAL LINGUISTICS, 1986, 13 (1-2) : 37 - 73

← 1 2 3 4 5 →