Learning Constraints on Autonomous Behavior from Proactive Feedback

被引：0

作者：

Basich, Connor ^{[1
]}

Mahmud, Saaduddin ^{[1
]}

Zilberstein, Shlomo ^{[1
]}

机构：

[1] Univ Massachusetts Amherst, Manning Coll Informat & Comp Sci, Amherst, MA 01003 USA

来源：

2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS | 2023年

关键词：

D O I：

10.1109/IROS55552.2023.10341801

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning from feedback is a common paradigm to acquire information that is hard to specify a priori. In this work, we consider an agent with a known nominal reward model that captures its high-level task objective. Furthermore, the agent operates subject to constraints that are unknown a priori and must be inferred from human interventions. Unlike existing methods, our approach does not rely on full or partial demonstration trajectories or assume a fully reactive human. Instead, we assume access only to sparse interventions, which may in fact be generated proactively by the human, and we only make minimal assumptions about the human. We provide both theoretical bounds on performance and empirical validations of our method. We show that our method enables an agent to learn a constraint set with high accuracy that generalizes well to new environments within a domain, whereas methods that only consider reactive feedback learn an incorrect constraint set that does not generalize well, making constraint violations more likely in new environments.

引用

页码：3680 / 3687

页数：8

共 50 条

[21] Proactive behavior as a reaction to job stressors: Stressor-specific proactive behavior and general proactive behavior
Spychala, Anne
Sonnentag, Sabine
INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 591 - 591
[22] Proactive Feedback for Networked CPS
Ghosh, Sumana
Mondal, Arnab
Roy, Debayan
Kindt, Philipp H.
Dey, Soumyajit
Chakraborty, Samarjit
36TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2021, 2021, : 164 - 173
[23] Effects of Correctness and Suggestive Feedback on Learning with an Autonomous Virtual Trainer
Shang, Xiumin
Kallmann, Marcelo
Arif, Ahmed Sabbir
PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES: COMPANION (IUI 2019), 2019, : 93 - 94
[24] Autonomous vehicle steering based on evaluative feedback by reinforcement learning
Kuhnert, KD
Krödel, M
MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINDS, 2005, 3587 : 405 - 414
[25] Basal Ganglia Models for Autonomous Behavior Learning
Tsujino, Hiroshi
Takeuchi, Johane
Shouno, Osamu
CREATING BRAIN-LIKE INTELLIGENCE: FROM BASIC PRINCIPLES TO COMPLEX INTELLIGENT SYSTEMS, 2009, 5436 : 328 - 350
[26] Autonomous Learning of Page Flipping Movements via Tactile Feedback
Zheng, Yi
Veiga, Filipe Fernandes
Peters, Jan
Santos, Veronica J.
IEEE TRANSACTIONS ON ROBOTICS, 2022, 38 (05) : 2734 - 2749
[27] Autonomous Learning Approach to Characterizing Motion Behavior
Anil, Rashmi
Khanna, Hemen
Keshavamurthy, Anil S.
Khanna, Rahul
Haswarey, Asif
PROCEEDINGS OF THE 2017 IEEE TOPICAL CONFERENCE ON WIRELESS SENSORS AND SENSOR NETWORKS (WISNET), 2017, : 49 - 52
[28] Proactive Collision Avoidance for Autonomous Ships: Leveraging Machine Learning to Emulate Situation Awareness
Murray, Brian
Perera, Lokukaluge Prasad
IFAC PAPERSONLINE, 2021, 54 (16): : 16 - 23
[29] Deep learning based image processing for proactive data collecting system for autonomous vehicle
Trong-Hop Do
Ngan-Linh Nguyen
Hoang-Thong Vo
Thanh-Binh Nguyen
Ngo Tan Vu Khanh
2021 21ST ACIS INTERNATIONAL WINTER CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD-WINTER 2021), 2021, : 253 - 256
[30] Proactive acquisition from tutoring and learning principles
Kim, J
Gil, Y
ARTIFICIAL INTELLIGENCE IN EDUCATION: SHAPING THE FUTURE OF LEARNING THROUGH INTELLIGENT TECHNOLOGIES, 2003, 97 : 175 - 182

← 1 2 3 4 5 →