Representing, learning, and controlling complex object interactions

被引:1
|
作者
Zhou, Yilun [1 ]
Burchfiel, Benjamin [2 ]
Konidaris, George [3 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Duke Univ, Duke Robot, Durham, NC USA
[3] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA
基金
美国国家卫生研究院;
关键词
Robotics; Task representation; Task learning; Markov decision process;
D O I
10.1007/s10514-018-9740-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a framework for representing scenarios with complex object interactions, where a robot cannot directly interact with the object it wishes to control and must instead influence it via intermediate objects. For instance, a robot learning to drive a car can only change the car's pose indirectly via the steering wheel, and must represent and reason about the relationship between its own grippers and the steering wheel, and the relationship between the steering wheel and the car. We formalize these interactions as chains and graphs of Markov decision processes (MDPs) and show how such models can be learned from data. We also consider how they can be controlled given known or learned dynamics. We show that our complex model can be collapsed into a single MDP and solved to find an optimal policy for the combined system. Since the resulting MDP may be very large, we also introduce a planning algorithm that efficiently produces a potentially suboptimal policy. We apply these models to two systems in which a robot uses learning from demonstration to achieve indirect control: playing a computer game using a joystick, and using a hot water dispenser to heat a cup of water.
引用
收藏
页码:1355 / 1367
页数:13
相关论文
共 50 条
  • [41] Representing the object of controversy: The case of the molecular clock
    Dietrich, Michael R.
    HISTORY AND PHILOSOPHY OF THE LIFE SCIENCES, 2007, 29 (02): : 161 - 176
  • [42] Design and use of a learning object for finding complex polynomial roots
    Benitez, Julio
    Gimenez, Marcos H.
    Hueso, Jose L.
    Martinez, Eulalia
    Riera, Jaime
    INTERNATIONAL JOURNAL OF MATHEMATICAL EDUCATION IN SCIENCE AND TECHNOLOGY, 2013, 44 (03) : 365 - 376
  • [43] Learning Human-Object Interactions by Graph Parsing Neural Networks
    Qi, Siyuan
    Wang, Wenguan
    Jia, Baoxiong
    Shen, Jianbing
    Zhu, Song-Chun
    COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 407 - 423
  • [44] Unraveling hidden interactions in complex systems with deep learning
    Seungwoong Ha
    Hawoong Jeong
    Scientific Reports, 11
  • [45] Visual dynamics cues in learning complex physical interactions
    Christopher J. Hasson
    Paneed F. Jalili
    Scientific Reports, 9
  • [46] Unraveling hidden interactions in complex systems with deep learning
    Ha, Seungwoong
    Jeong, Hawoong
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [47] Complex Interactions in Student Teaching Lost Opportunities for Learning
    Valencia, Sheila W.
    Martin, Susan D.
    Place, Nancy A.
    Grossman, Pam
    JOURNAL OF TEACHER EDUCATION, 2009, 60 (03) : 304 - 322
  • [48] Visual dynamics cues in learning complex physical interactions
    Hasson, Christopher J.
    Jalili, Paneed F.
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [49] A Reinforcement Learning Based Approach for Controlling Autonomous Vehicles in Complex Scenarios
    Ben Elallid, Badr
    Bagaa, Miloud
    Benamar, Nabil
    Mrani, Nabil
    2023 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2023, : 1358 - 1364
  • [50] The hebbosome is a large multiprotein complex detecting and controlling synaptic activity and learning
    Grant, SGN
    JOURNAL OF NEUROCHEMISTRY, 2001, 78 : 204 - 204