A framework for learning semantic maps from grounded natural language descriptions

被引:24
|
作者
Walter, Matthew R. [1 ]
Hemachandra, Sachithra [1 ]
Homberg, Bianca [1 ]
Tellex, Stefanie [2 ]
Teller, Seth [1 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[2] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA
来源
关键词
Semantic mapping; Rao-Blackwellization; mapping; particle filter; natural language understanding; human-robot interaction; SIMULTANEOUS LOCALIZATION; SPACE;
D O I
10.1177/0278364914537359
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
This paper describes a framework that enables robots to efficiently learn human-centric models of their environment from natural language descriptions. Typical semantic mapping approaches are limited to augmenting metric maps with higher-level properties of the robot's surroundings (e. g. place type, object locations) that can be inferred from the robot's sensor data, but do not use this information to improve the metric map. The novelty of our algorithm lies in fusing high-level knowledge that people can uniquely provide through speech with metric information from the robot's low-level sensor streams. Our method jointly estimates a hybrid metric, topological, and semantic representation of the environment. This semantic graph provides a common framework in which we integrate information that the user communicates (e. g. labels and spatial relations) with metric observations from low-level sensors. Our algorithm efficiently maintains a factored distribution over semantic graphs based upon the stream of natural language and low-level sensor information. We detail the means by which the framework incorporates knowledge conveyed by the user's descriptions, including the ability to reason over expressions that reference yet unknown regions in the environment. We evaluate the algorithm's ability to learn human-centric maps of several different environments and analyze the knowledge inferred from language and the utility of the learned maps. The results demonstrate that the incorporation of information from free-form descriptions increases the metric, topological, and semantic accuracy of the recovered environment model.
引用
收藏
页码:1167 / 1190
页数:24
相关论文
共 50 条
  • [1] Learning Spatial-Semantic Representations from Natural Language Descriptions and Scene Classifications
    Hemachandra, Sachithra
    Walter, Matthew R.
    Tellex, Stefanie
    Teller, Seth
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 2623 - 2630
  • [2] Semantic Novelty Detection in Natural Language Descriptions
    Ma, Nianzu
    Politowicz, Alexander
    Mazumder, Sahisnu
    Chen, Jiahua
    Liu, Bing
    Robertson, Eric
    Grigsby, Scott
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 866 - 882
  • [3] Learning to Read Maps: Understanding Natural Language Instructions from Unseen Maps
    Katsakioris, Miltiadis Marios
    Konstas, Ioannis
    Mignotte, Pierre Yves
    Hastie, Helen
    [J]. SPLU-ROBONLP 2021: THE 2ND INTERNATIONAL COMBINED WORKSHOP ON SPATIAL LANGUAGE UNDERSTANDING AND GROUNDED COMMUNICATION FOR ROBOTICS, 2021, : 11 - 21
  • [4] A framework for creating natural language descriptions of video streams
    Khan, Muhammad Usman Ghani
    Al Harbi, Nouf
    Gotoh, Yoshihiko
    [J]. INFORMATION SCIENCES, 2015, 303 : 61 - 82
  • [5] Learning semantic sentence representations from visually grounded language without lexical knowledge
    Merkx, Danny
    Frank, Stefan L.
    [J]. NATURAL LANGUAGE ENGINEERING, 2019, 25 (04) : 451 - 466
  • [6] Learning to Learn Semantic Parsers from Natural Language Supervision
    Labutov, Igor
    Yang, Bishan
    Mitchell, Tom
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1676 - 1690
  • [7] Using Semantic Maps for Robust Natural Language Interaction with Robots
    Bastianelli, Emanuele
    Croce, Danilo
    Basili, Roberto
    Nardi, Daniele
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1393 - 1397
  • [8] Semantic vector learning for natural language understanding
    Jung, Sangkeun
    [J]. COMPUTER SPEECH AND LANGUAGE, 2019, 56 : 130 - 145
  • [9] Semantic processing of natural language queries in the OntoNL framework
    Karanastasi, Anastasia
    Christodoulakis, Stavros
    [J]. ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, : 686 - +
  • [10] AN ATTEMPT AT A FRAMEWORK FOR SEMANTIC INTERPRETATION OF NATURAL-LANGUAGE
    PEREGRIN, J
    SGALL, P
    [J]. THEORETICAL LINGUISTICS, 1986, 13 (1-2) : 37 - 73