Online Learning of Feasible Strategies in Unknown Environments

被引:0
|
作者
Paternain, Santiago [1 ]
Ribeiro, Alejandro [1 ]
机构
[1] Univ Penn, Dept Elect & Syst Engn, 200 S 33rd St, Philadelphia, PA 19104 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An environment is defined as a set of constraint functions that vary arbitrarily over time. An agent wants to select feasible actions that keep all the constraints negative, but must do so causally. I.e., the dynamical system that determines actions is such that only their time derivatives can depend on the current constraints. An environment is said viable if there exists an action that can satisfy the constraints for all times. The fit of a trajectory is defined as a vector that integrates the constraint violations over time and is used to measure the extent to which a policy succeeds in learning feasible actions. An online saddle point controller is proposed to control fit and shown to do so under minimal technical conditions. The online saddle point controller pushes actions along a linear combination of the constraint negative gradients and dynamically adapts the coefficients of this linear combination to find appropriate weightings. Concepts are illustrated throughout with the problem of a shepherd that wants to stay close to all sheep in a herd. Numerical experiments show that the controller allows the shepherd to do so.
引用
收藏
页码:4231 / 4238
页数:8
相关论文
共 50 条
  • [1] Online Learning of Feasible Strategies in Unknown Environments
    Paternain, Santiago
    Ribeiro, Alejandro
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (06) : 2807 - 2822
  • [2] Online Learning of Optimal Strategies in Unknown Environments
    Paternain, Santiago
    Ribeiro, Alejandro
    [J]. 2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 3951 - 3958
  • [3] Planning Feasible and Safe Paths Online for Autonomous Underwater Vehicles in Unknown Environments
    David Hernandez, Juan
    Moll, Mark
    Vidal, Eduard
    Carreras, Marc
    Kavraki, Lydia E.
    [J]. 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 1313 - 1320
  • [4] Strategies to motivate students in online learning environments
    Beffa-Negrini, PA
    Cohen, NL
    Miller, B
    [J]. JOURNAL OF NUTRITION EDUCATION AND BEHAVIOR, 2002, 34 (06) : 334 - 340
  • [5] An Iterative Online Approach to Safe Learning in Unknown Constrained Environments
    Minh Vu
    Zeng, Shen
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 7330 - 7335
  • [6] Online Learning for Characterizing Unknown Environments in Ground Robotic Vehicle Models
    Koppel, Alec
    Fink, Jonathan
    Warnell, Garrett
    Stump, Ethan
    Ribeiro, Alejandro
    [J]. 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 626 - 633
  • [7] Students' cognitive engagement in online learning environments: Learning strategies and motivations
    Richardson, JC
    Newby, T
    [J]. ED-MEDIA 2004: WORLD CONFERENCE ON EDUCATIONAL MULTIMEDIA, HYPERMEDIA & TELECOMMUNICATIONS, VOLS. 1-7, 2004, : 4028 - 4033
  • [8] Self-regulated learning in online learning environments: strategies for remote learning
    Carter Jr, Richard Allen
    Rice, Mary
    Yang, Sohyun
    Jackson, Haidee A.
    [J]. INFORMATION AND LEARNING SCIENCES, 2020, 121 (5-6) : 321 - 329
  • [9] Analytics of Time Management and Learning Strategies for Effective Online Learning in Blended Environments
    Uzir, Nora'ayu Ahmad
    Gasevic, Dragan
    Jovanovic, Jelena
    Matcha, Wannisa
    Lim, Lisa-Angelique
    Fudge, Anthea
    [J]. LAK20: THE TENTH INTERNATIONAL CONFERENCE ON LEARNING ANALYTICS & KNOWLEDGE, 2020, : 392 - 401
  • [10] Exploring unknown environments with randomized strategies
    Espinoza, Judith
    Sanchez, Abraham
    Osorio, Maria
    [J]. MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 1150 - +