Learning Constraints on Autonomous Behavior from Proactive Feedback

被引:0
|
作者
Basich, Connor [1 ]
Mahmud, Saaduddin [1 ]
Zilberstein, Shlomo [1 ]
机构
[1] Univ Massachusetts Amherst, Manning Coll Informat & Comp Sci, Amherst, MA 01003 USA
关键词
D O I
10.1109/IROS55552.2023.10341801
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning from feedback is a common paradigm to acquire information that is hard to specify a priori. In this work, we consider an agent with a known nominal reward model that captures its high-level task objective. Furthermore, the agent operates subject to constraints that are unknown a priori and must be inferred from human interventions. Unlike existing methods, our approach does not rely on full or partial demonstration trajectories or assume a fully reactive human. Instead, we assume access only to sparse interventions, which may in fact be generated proactively by the human, and we only make minimal assumptions about the human. We provide both theoretical bounds on performance and empirical validations of our method. We show that our method enables an agent to learn a constraint set with high accuracy that generalizes well to new environments within a domain, whereas methods that only consider reactive feedback learn an incorrect constraint set that does not generalize well, making constraint violations more likely in new environments.
引用
收藏
页码:3680 / 3687
页数:8
相关论文
共 50 条
  • [21] Proactive behavior as a reaction to job stressors: Stressor-specific proactive behavior and general proactive behavior
    Spychala, Anne
    Sonnentag, Sabine
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 591 - 591
  • [22] Proactive Feedback for Networked CPS
    Ghosh, Sumana
    Mondal, Arnab
    Roy, Debayan
    Kindt, Philipp H.
    Dey, Soumyajit
    Chakraborty, Samarjit
    36TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2021, 2021, : 164 - 173
  • [23] Effects of Correctness and Suggestive Feedback on Learning with an Autonomous Virtual Trainer
    Shang, Xiumin
    Kallmann, Marcelo
    Arif, Ahmed Sabbir
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES: COMPANION (IUI 2019), 2019, : 93 - 94
  • [24] Autonomous vehicle steering based on evaluative feedback by reinforcement learning
    Kuhnert, KD
    Krödel, M
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINDS, 2005, 3587 : 405 - 414
  • [25] Basal Ganglia Models for Autonomous Behavior Learning
    Tsujino, Hiroshi
    Takeuchi, Johane
    Shouno, Osamu
    CREATING BRAIN-LIKE INTELLIGENCE: FROM BASIC PRINCIPLES TO COMPLEX INTELLIGENT SYSTEMS, 2009, 5436 : 328 - 350
  • [26] Autonomous Learning of Page Flipping Movements via Tactile Feedback
    Zheng, Yi
    Veiga, Filipe Fernandes
    Peters, Jan
    Santos, Veronica J.
    IEEE TRANSACTIONS ON ROBOTICS, 2022, 38 (05) : 2734 - 2749
  • [27] Autonomous Learning Approach to Characterizing Motion Behavior
    Anil, Rashmi
    Khanna, Hemen
    Keshavamurthy, Anil S.
    Khanna, Rahul
    Haswarey, Asif
    PROCEEDINGS OF THE 2017 IEEE TOPICAL CONFERENCE ON WIRELESS SENSORS AND SENSOR NETWORKS (WISNET), 2017, : 49 - 52
  • [28] Proactive Collision Avoidance for Autonomous Ships: Leveraging Machine Learning to Emulate Situation Awareness
    Murray, Brian
    Perera, Lokukaluge Prasad
    IFAC PAPERSONLINE, 2021, 54 (16): : 16 - 23
  • [29] Deep learning based image processing for proactive data collecting system for autonomous vehicle
    Trong-Hop Do
    Ngan-Linh Nguyen
    Hoang-Thong Vo
    Thanh-Binh Nguyen
    Ngo Tan Vu Khanh
    2021 21ST ACIS INTERNATIONAL WINTER CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD-WINTER 2021), 2021, : 253 - 256
  • [30] Proactive acquisition from tutoring and learning principles
    Kim, J
    Gil, Y
    ARTIFICIAL INTELLIGENCE IN EDUCATION: SHAPING THE FUTURE OF LEARNING THROUGH INTELLIGENT TECHNOLOGIES, 2003, 97 : 175 - 182