A hybrid learning agent for episodic learning tasks with unknown target distance

被引:0
|
作者
Oliver Sefrin [1 ]
Sabine Wölk [1 ]
机构
[1] German Aerospace Center (DLR),Institute of Quantum Technologies
[2] Ulm University,Institute for Complex Quantum Systems
关键词
Quantum reinforcement learning; Amplitude amplification; Hybrid algorithm; Navigation problem;
D O I
10.1007/s42484-025-00269-1
中图分类号
学科分类号
摘要
The “hybrid agent for quantum-accessible reinforcement learning,” as defined in (Hamann and Wölk New J Phys 24:033044 2022), provides a proven quasi-quadratic speedup and is experimentally tested. However, the standard version can only be applied to episodic learning tasks with fixed episode length. In many real-world applications, the information about the necessary number of steps within an episode to reach a defined target is not available in advance and especially before reaching the target for the first time. Furthermore, in such scenarios, classical agents have the advantage of observing at which step they reach the target. How to best deal with an unknown target distance in classical and quantum reinforcement learning and whether the hybrid agent can provide an advantage in such learning scenarios is unknown so far. In this work, we introduce a hybrid agent with a stochastic episode length selection strategy to alleviate the need for knowledge about the necessary episode length. Through simulations, we test the adapted hybrid agent’s performance versus classical counterparts with and without similar episode selection strategies. Our simulations demonstrate a speedup in certain scenarios due to our developed episode length selection strategy for classical learning agents as well as an additional speedup for our resulting hybrid learning agent.
引用
收藏
相关论文
共 50 条
  • [31] Learning Reward Machines in Cooperative Multi-agent Tasks
    Ardon, Leo
    Furelos-Blanco, Daniel
    Russo, Alessandra
    AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS. BEST AND VISIONARY PAPERS, AAMAS 2023 WORKSHOPS, 2024, 14456 : 43 - 59
  • [32] ALMA: Hierarchical Learning for Composite Multi-Agent Tasks
    Iqbal, Shariq
    Costales, Robby
    Sha, Fei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [33] Teaching Mathematics in Hybrid Learning and Distance Learning: Student's ' s View
    Puman, Ella
    Kritsevskaja, Aljona
    INTERNATIONAL JOURNAL FOR TECHNOLOGY IN MATHEMATICS EDUCATION, 2024, 31 (02):
  • [34] Recursive Bayesian inference and learning for target tracking with unknown maneuvers
    Ji, Ruiping
    Liang, Yan
    Xu, Linfeng
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2022, 36 (04) : 1032 - 1044
  • [35] LEARNING TASKS
    MCKENNA, J
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD, 1976, 8 (01) : 47 - 47
  • [36] Effective, Efficient, and Scalable Unsupervised Distance Learning in Image Retrieval Tasks
    Valem, Lucas Pascotti
    Guimaraes Pedronette, Daniel Carlos
    Torres, Ricardo da S.
    Borin, Edson
    Almeida, Jurandy
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 51 - 58
  • [37] Estimating the conceptual distance between unknown words using machine learning
    Sakai, Yuya
    Matsumoto, Mitsuharu
    2022 JOINT 12TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 23RD INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS&ISIS), 2022,
  • [38] Kinship verification using multiview hybrid distance learning
    Mahpod, Shahar
    Keller, Yosi
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 167 : 28 - 36
  • [39] Assessing Conventional, Hybrid and Distance Learning Courses in Horticulture
    Sciarappa, W. J.
    Quinn, V.
    Ward, B.
    Ward, D.
    HORTSCIENCE, 2019, 54 (09) : S428 - S428
  • [40] TORM: A hybrid multicast infrastructure for interactive distance learning
    Che, Y
    Shi, RT
    Shi, YC
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1595 - 1598