Affordance-based robot object retrieval

被引:4
|
作者
Thao Nguyen [1 ]
Gopalan, Nakul [1 ,4 ]
Patel, Roma [1 ]
Corsaro, Matt [2 ]
Pavlick, Ellie [3 ]
Tellex, Stefanie [3 ]
机构
[1] Brown Univ, Providence, RI 02912 USA
[2] Brown Univ, Comp Sci, George Konidariss Intelligent Robot Lab, Providence, RI 02912 USA
[3] Brown Univ, Comp Sci, Providence, RI 02912 USA
[4] Georgia Inst Technol, Atlanta, GA 30332 USA
基金
美国国家科学基金会;
关键词
Robots;
D O I
10.1007/s10514-021-10008-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural language object retrieval is a highly useful yet challenging task for robots in human-centric environments. Previous work has primarily focused on commands specifying the desired object's type such as "scissors" and/or visual attributes such as "red," thus limiting the robot to only known object classes. We develop a model to retrieve objects based on descriptions of their usage. The model takes in a language command containing a verb, for example "Hand me something to cut," and RGB images of candidate objects; and outputs the object that best satisfies the task specified by the verb. Our model directly predicts an object's appearance from the object's use specified by a verb phrase, without needing an object's class label. Based on contextual information present in the language commands, our model can generalize to unseen object classes and unknown nouns in the commands. Our model correctly selects objects out of sets of five candidates to fulfill natural language commands, and achieves a mean reciprocal rank of 77.4% on a held-out test set of unseen ImageNet object classes and 69.1% on unseen object classes and unknown nouns. Our model also achieves a mean reciprocal rank of 71.8% on unseen YCB object classes, which have a different image distribution from ImageNet. We demonstrate our model on a KUKA LBR iiwa robot arm, enabling the robot to retrieve objects based on natural language descriptions of their usage (Video recordings of the robot demonstrations can be found at ). We also present a new dataset of 655 verb-object pairs denoting object usage over 50 verbs and 216 object classes (The dataset and code for the project can be found at https://github.com/Thaonguyen3095/affordance- language).
引用
收藏
页码:83 / 98
页数:16
相关论文
共 50 条
  • [21] Affordance-Based Object Recognition Using Interactions Obtained from a Utility Maximization Principle
    Kluth, Tobias
    Nakath, David
    Reineking, Thomas
    Zetzsche, Christoph
    Schill, Kerstin
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT II, 2015, 8926 : 406 - 412
  • [22] GAM: General affordance-based manipulation for contact-rich object disentangling tasks
    Yang, Xintong
    Wu, Jing
    Lai, Yu-Kun
    Ji, Ze
    NEUROCOMPUTING, 2024, 578
  • [23] Affordance-based problem structuring for workplace innovation
    Durugbo, Christopher M.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2020, 284 (02) : 617 - 631
  • [24] Effects of Cognitive Load on Affordance-based Interactions
    Grgic, Joseph E.
    Still, Mary L.
    Still, Jeremiah D.
    APPLIED COGNITIVE PSYCHOLOGY, 2016, 30 (06) : 1042 - 1051
  • [25] An Affordance-Based Approach to Visually Guided Overtaking
    Morice, Antoine H. P.
    Diaz, Gabriel J.
    Fajen, Brett R.
    Basilio, Numa
    Montagne, Gilles
    ECOLOGICAL PSYCHOLOGY, 2015, 27 (01) : 1 - 25
  • [26] Nested affordance-based intuitive design tool: Affordance interaction matrix
    Gao, Yixuan
    Song, Duanshu
    Liu, Li
    Huang, Yuexin
    COGENT ENGINEERING, 2023, 10 (01):
  • [27] The Significance of Ecological Methodology of Affordance-Based Design
    Luo, Lingling
    Wang, Xiaohang
    RENEWABLE ENERGY AND ENVIRONMENTAL TECHNOLOGY, PTS 1-6, 2014, 448-453 : 890 - 896
  • [28] Redesigning a Website Using Affordance-Based Design
    Rokhmawati, Retno Indah
    Az-Zahra, Hanifah Muslimah
    Fadhilah, Nissa Madaniyah
    PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET 2019), 2019, : 295 - 300
  • [29] Affordance-based Ontology Design for Ubiquitous Robots
    Hidayat, Sidiq S.
    Kim, B. K.
    Ohba, Kohtaro
    2008 17TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, VOLS 1 AND 2, 2008, : 622 - +
  • [30] The Aspect Transition Graph: An Affordance-Based Model
    Ku, Li Yang
    Sen, Shiraj
    Learned-Miller, Erik G.
    Grupen, Roderic A.
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT II, 2015, 8926 : 459 - 465