Learning Constraints on Autonomous Behavior from Proactive Feedback

被引:0
|
作者
Basich, Connor [1 ]
Mahmud, Saaduddin [1 ]
Zilberstein, Shlomo [1 ]
机构
[1] Univ Massachusetts Amherst, Manning Coll Informat & Comp Sci, Amherst, MA 01003 USA
关键词
D O I
10.1109/IROS55552.2023.10341801
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning from feedback is a common paradigm to acquire information that is hard to specify a priori. In this work, we consider an agent with a known nominal reward model that captures its high-level task objective. Furthermore, the agent operates subject to constraints that are unknown a priori and must be inferred from human interventions. Unlike existing methods, our approach does not rely on full or partial demonstration trajectories or assume a fully reactive human. Instead, we assume access only to sparse interventions, which may in fact be generated proactively by the human, and we only make minimal assumptions about the human. We provide both theoretical bounds on performance and empirical validations of our method. We show that our method enables an agent to learn a constraint set with high accuracy that generalizes well to new environments within a domain, whereas methods that only consider reactive feedback learn an incorrect constraint set that does not generalize well, making constraint violations more likely in new environments.
引用
收藏
页码:3680 / 3687
页数:8
相关论文
共 50 条
  • [1] Learning Behavior Trees for Autonomous Agents with Hybrid Constraints Evolution
    Zhang, Qi
    Yao, Jian
    Yin, Quanjun
    Zha, Yabing
    APPLIED SCIENCES-BASEL, 2018, 8 (07):
  • [2] Proactive autonomous resource enrichment for e-learning
    Mencke, Steffen
    Rud, Dmytro
    Zbrog, Fritz
    Durnke, Reiner
    WEBIST 2008: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, 2008, : 464 - 467
  • [3] Supervisor developmental feedback and employees' proactive innovation behavior
    Fang, Yangchun
    Liu, Yonghua
    Chen, Nuo
    SOCIAL BEHAVIOR AND PERSONALITY, 2024, 52 (10):
  • [4] When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning
    Xie, Annie
    Tajwar, Fahim
    Sharma, Archit
    Finn, Chelsea
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [5] Learning proactive behavior for interactive social robots
    Phoebe Liu
    Dylan F. Glas
    Takayuki Kanda
    Hiroshi Ishiguro
    Autonomous Robots, 2018, 42 : 1067 - 1085
  • [6] Learning proactive behavior for interactive social robots
    Liu, Phoebe
    Glas, Dylan F.
    Kanda, Takayuki
    Ishiguro, Hiroshi
    AUTONOMOUS ROBOTS, 2018, 42 (05) : 1067 - 1085
  • [7] Market Feedback, Investment Constraints, and Managerial Behavior
    Hill, Paula
    Hillier, David
    EUROPEAN FINANCIAL MANAGEMENT, 2009, 15 (03) : 584 - 605
  • [8] Assimilating human feedback from autonomous vehicle interaction in reinforcement learning models
    Fox, Richard
    Ludvig, Elliot A.
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2024, 38 (02)
  • [9] A Concept for Proactive Knowledge Construction in Self-Learning Autonomous Systems
    Stein, Anthony
    Tomforde, Sven
    Diaconescu, Ada
    Haehner, Joerg
    Mueller-Schloer, Christian
    2018 IEEE 3RD INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W), 2018, : 204 - 213
  • [10] Autonomous Learning based Proactive Deployment for UAV Assisted Wireless Networks
    Wang, Yatong
    Yan, Mu
    Feng, Gang
    Qin, Shuang
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 1320 - 1325