A hybrid learning agent for episodic learning tasks with unknown target distance

被引:0
|
作者
Oliver Sefrin [1 ]
Sabine Wölk [1 ]
机构
[1] German Aerospace Center (DLR),Institute of Quantum Technologies
[2] Ulm University,Institute for Complex Quantum Systems
关键词
Quantum reinforcement learning; Amplitude amplification; Hybrid algorithm; Navigation problem;
D O I
10.1007/s42484-025-00269-1
中图分类号
学科分类号
摘要
The “hybrid agent for quantum-accessible reinforcement learning,” as defined in (Hamann and Wölk New J Phys 24:033044 2022), provides a proven quasi-quadratic speedup and is experimentally tested. However, the standard version can only be applied to episodic learning tasks with fixed episode length. In many real-world applications, the information about the necessary number of steps within an episode to reach a defined target is not available in advance and especially before reaching the target for the first time. Furthermore, in such scenarios, classical agents have the advantage of observing at which step they reach the target. How to best deal with an unknown target distance in classical and quantum reinforcement learning and whether the hybrid agent can provide an advantage in such learning scenarios is unknown so far. In this work, we introduce a hybrid agent with a stochastic episode length selection strategy to alleviate the need for knowledge about the necessary episode length. Through simulations, we test the adapted hybrid agent’s performance versus classical counterparts with and without similar episode selection strategies. Our simulations demonstrate a speedup in certain scenarios due to our developed episode length selection strategy for classical learning agents as well as an additional speedup for our resulting hybrid learning agent.
引用
收藏
相关论文
共 50 条
  • [11] Designing e-learning courses for classroom and distance learning in physics: The role of learning tasks
    Laumann, Daniel
    Fischer, Julian Alexander
    Stuermer-Steinmann, Tatjana K.
    Welberg, Julia
    Wessnigk, Susanne
    Neumann, Knut
    PHYSICAL REVIEW PHYSICS EDUCATION RESEARCH, 2024, 20 (01):
  • [12] A Methodology for the Development of Distance Learning Tasks Adaptable to the Student's Learning Style
    Ferrer, Eduardo
    Kirschning, Ingrid
    4TH WORLD CONFERENCE ON LEARNING TEACHING AND EDUCATIONAL LEADERSHIP (WCLTA-2013), 2014, 141 : 518 - 523
  • [13] Examining a hybrid distance learning model
    French, W
    Jaju, A
    Brown, S
    2000 AMA WINTER EDUCATORS' CONFERENCE - MARKETING THEORY AND APPLICATIONS, 2000, 11 : 13 - 21
  • [14] The Design of Tasks to Suit Distance Learning in Emergency Education
    Daher, Wajeeh
    Abo Mokh, Amnah
    Shayeb, Shaheen
    Jaber, Reema
    Saqer, Khitam
    Dawood, Iman
    Bsharat, Maysa
    Rabbaa, Mohammad
    SUSTAINABILITY, 2022, 14 (03)
  • [15] Constructing Stochastic Mixture Policies for Episodic Multiobjective Reinforcement Learning Tasks
    Vamplew, Peter
    Dazeley, Richard
    Barker, Ewan
    Kelarev, Andrei
    AI 2009: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5866 : 340 - 349
  • [16] How Emotional Mechanism Helps Episodic Learning in a Cognitive Agent
    Faghihi, Usef
    Fournier-Viger, Philippe
    Nkambou, Roger
    Poirier, Pierre
    Mayers, Andre
    IA 2009: IEEE SYMPOSIUM ON INTELLIGENT AGENTS, 2009, : 23 - +
  • [17] Heterogeneous Skill Learning for Multi-agent Tasks
    Liu, Yuntao
    Li, Yuan
    Xu, Xinhai
    Dou, Yong
    Liu, Donghong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [18] A Reference Architecture of a Hybrid Learning Agent
    Leite, Adriana
    Girardi, Rosario
    2016 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2016), 2016, : 421 - 424
  • [19] Hybrid Trajectory and Force Learning of Complex Assembly Tasks: A Combined Learning Framework
    Wang, Yan
    Beltran-Hernandez, Cristian C.
    Wan, Weiwei
    Harada, Kensuke
    IEEE Access, 2021, 9 : 60175 - 60186
  • [20] Hybrid Trajectory and Force Learning of Complex Assembly Tasks: A Combined Learning Framework
    Wang, Yan
    Beltran-Hernandez, Cristian C.
    Wan, Weiwei
    Harada, Kensuke
    IEEE ACCESS, 2021, 9 : 60175 - 60186