A hybrid learning agent for episodic learning tasks with unknown target distance

被引:0
|
作者
Oliver Sefrin [1 ]
Sabine Wölk [1 ]
机构
[1] German Aerospace Center (DLR),Institute of Quantum Technologies
[2] Ulm University,Institute for Complex Quantum Systems
关键词
Quantum reinforcement learning; Amplitude amplification; Hybrid algorithm; Navigation problem;
D O I
10.1007/s42484-025-00269-1
中图分类号
学科分类号
摘要
The “hybrid agent for quantum-accessible reinforcement learning,” as defined in (Hamann and Wölk New J Phys 24:033044 2022), provides a proven quasi-quadratic speedup and is experimentally tested. However, the standard version can only be applied to episodic learning tasks with fixed episode length. In many real-world applications, the information about the necessary number of steps within an episode to reach a defined target is not available in advance and especially before reaching the target for the first time. Furthermore, in such scenarios, classical agents have the advantage of observing at which step they reach the target. How to best deal with an unknown target distance in classical and quantum reinforcement learning and whether the hybrid agent can provide an advantage in such learning scenarios is unknown so far. In this work, we introduce a hybrid agent with a stochastic episode length selection strategy to alleviate the need for knowledge about the necessary episode length. Through simulations, we test the adapted hybrid agent’s performance versus classical counterparts with and without similar episode selection strategies. Our simulations demonstrate a speedup in certain scenarios due to our developed episode length selection strategy for classical learning agents as well as an additional speedup for our resulting hybrid learning agent.
引用
收藏
相关论文
共 50 条
  • [21] Towards hybrid technical learning: Transforming traditional Laboratories for distance learning
    Abekiri, Najib
    Ajaamoum, Mohammed
    Rachdy, Azzedine
    Nassiri, Boujemaa
    Benydir, Mohamed
    COMPUTER APPLICATIONS IN ENGINEERING EDUCATION, 2024, 32 (05)
  • [22] The chemotherapeutic agent paclitaxel selectively impairs reversal learning while sparing prior learning, new learning and episodic memory
    Panoz-Brown, Danielle
    Carey, Lawrence M.
    Smith, Alexandra E.
    Gentry, Meredith
    Sluka, Christina M.
    Corbin, Hannah E.
    Wu, Jie-En
    Hohmann, Andrea G.
    Crystal, Jonathon D.
    NEUROBIOLOGY OF LEARNING AND MEMORY, 2017, 144 : 259 - 270
  • [23] Some aspects of using JSP technology for distance learning tasks
    Shevlyakov A.S.
    Journal of Mathematical Sciences, 2001, 107 (6) : 4497 - 4499
  • [24] pyUDLF: A Python']Python Framework for Unsupervised Distance Learning Tasks
    Leticio, Gustavo Rosseto
    Valem, Lucas Pascotti
    Lopes, Leonardo Tadeu
    Guimaraes Pedronette, Daniel Carlos
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9680 - 9684
  • [25] NEFRL: A new neuro-fuzzy system for episodic reinforcement learning tasks
    Behsaz, Babak
    Safabakhsh, Reza
    PROCEEDINGS OF THE FRONTIERS IN THE CONVERGENCE OF BIOSCIENCE AND INFORMATION TECHNOLOGIES, 2007, : 819 - 824
  • [26] Learning to Look Around: Intelligently Exploring Unseen Environments for Unknown Tasks
    Jayaraman, Dinesh
    Grauman, Kristen
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1238 - 1247
  • [27] Hybrid Imitation Learning Framework for Robotic Manipulation Tasks
    Jung, Eunjin
    Kim, Incheol
    SENSORS, 2021, 21 (10)
  • [28] Hybrid Robotic Reinforcement Learning for Inspection/Correction Tasks
    Nasereddin, Hoda
    Knapp, Gerald M.
    25TH INTERNATIONAL CONFERENCE ON PRODUCTION RESEARCH MANUFACTURING INNOVATION: CYBER PHYSICAL MANUFACTURING, 2019, 39 : 406 - 413
  • [29] An agent-based personalized distance learning system
    Koyama, A
    Barolli, L
    Tsuda, A
    Cheng, Z
    15TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, PROCEEDINGS, 2001, : 895 - 899
  • [30] An Agent Service Grid for Supporting Open and Distance Learning
    Grosso, Alberto
    Anghinolfi, Davide
    Boccalatte, Antonio
    Vecchiola, Christian
    REMOTE INSTRUMENTATION FOR ESCIENCE AND RELATED ASPECTS, 2012, : 129 - 143