Learning Constraints on Autonomous Behavior from Proactive Feedback

被引：0

作者：

Basich, Connor ^{[1
]}

Mahmud, Saaduddin ^{[1
]}

Zilberstein, Shlomo ^{[1
]}

机构：

[1] Univ Massachusetts Amherst, Manning Coll Informat & Comp Sci, Amherst, MA 01003 USA

来源：

2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS | 2023年

关键词：

D O I：

10.1109/IROS55552.2023.10341801

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning from feedback is a common paradigm to acquire information that is hard to specify a priori. In this work, we consider an agent with a known nominal reward model that captures its high-level task objective. Furthermore, the agent operates subject to constraints that are unknown a priori and must be inferred from human interventions. Unlike existing methods, our approach does not rely on full or partial demonstration trajectories or assume a fully reactive human. Instead, we assume access only to sparse interventions, which may in fact be generated proactively by the human, and we only make minimal assumptions about the human. We provide both theoretical bounds on performance and empirical validations of our method. We show that our method enables an agent to learn a constraint set with high accuracy that generalizes well to new environments within a domain, whereas methods that only consider reactive feedback learn an incorrect constraint set that does not generalize well, making constraint violations more likely in new environments.

引用

页码：3680 / 3687

页数：8

共 50 条

[1] Learning Behavior Trees for Autonomous Agents with Hybrid Constraints Evolution
Zhang, Qi
Yao, Jian
Yin, Quanjun
Zha, Yabing
APPLIED SCIENCES-BASEL, 2018, 8 (07):
[2] Proactive autonomous resource enrichment for e-learning
Mencke, Steffen
Rud, Dmytro
Zbrog, Fritz
Durnke, Reiner
WEBIST 2008: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, 2008, : 464 - 467
[3] Supervisor developmental feedback and employees' proactive innovation behavior
Fang, Yangchun
Liu, Yonghua
Chen, Nuo
SOCIAL BEHAVIOR AND PERSONALITY, 2024, 52 (10):
[4] When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning
Xie, Annie
Tajwar, Fahim
Sharma, Archit
Finn, Chelsea
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[5] Learning proactive behavior for interactive social robots
Phoebe Liu
Dylan F. Glas
Takayuki Kanda
Hiroshi Ishiguro
Autonomous Robots, 2018, 42 : 1067 - 1085
[6] Learning proactive behavior for interactive social robots
Liu, Phoebe
Glas, Dylan F.
Kanda, Takayuki
Ishiguro, Hiroshi
AUTONOMOUS ROBOTS, 2018, 42 (05) : 1067 - 1085
[7] Market Feedback, Investment Constraints, and Managerial Behavior
Hill, Paula
Hillier, David
EUROPEAN FINANCIAL MANAGEMENT, 2009, 15 (03) : 584 - 605
[8] Assimilating human feedback from autonomous vehicle interaction in reinforcement learning models
Fox, Richard
Ludvig, Elliot A.
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2024, 38 (02)
[9] A Concept for Proactive Knowledge Construction in Self-Learning Autonomous Systems
Stein, Anthony
Tomforde, Sven
Diaconescu, Ada
Haehner, Joerg
Mueller-Schloer, Christian
2018 IEEE 3RD INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W), 2018, : 204 - 213
[10] Autonomous Learning based Proactive Deployment for UAV Assisted Wireless Networks
Wang, Yatong
Yan, Mu
Feng, Gang
Qin, Shuang
2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 1320 - 1325

← 1 2 3 4 5 →