Quantum-enhanced reinforcement learning for control: a preliminary study

被引:2
|
作者
Hu, Yazhou [1 ]
Tang, Fengzhen [2 ,3 ]
Chen, Jun [1 ]
Wang, Wenxue [2 ,3 ]
机构
[1] Northwest A&F Univ, Coll Mech & Elect Engn, Yangling 712100, Shaanxi, Peoples R China
[2] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Liaoning, Peoples R China
[3] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110169, Liaoning, Peoples R China
关键词
Quantum theory; Reinforcement learning; Quantum computation; State superposition; Optimal control; SPEED-UP; COMPUTATION; IMPLEMENTATION; ALGORITHM;
D O I
10.1007/s11768-021-00063-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning is one of the fastest growing areas in machine learning, and has obtained great achievements in biomedicine, Internet of Things (IoT), logistics, robotic control, etc. However, there are still many challenges for engineering applications, such as how to speed up the learning process, how to balance the trade-off between exploration and exploitation. Quantum technology, which can solve complex problems faster than classical methods, especially in supercomputers, provides us a new paradigm to overcome these challenges in reinforcement learning. In this paper, a quantum-enhanced reinforcement learning is pictured for optimal control. In this algorithm, the states and actions of reinforcement learning are quantized by quantum technology. And then, a probability amplification method, which can effectively avoid the trade-off between exploration and exploitation via quantized technology, is presented. Finally, the optimal control policy is learnt during the process of reinforcement learning. The performance of this quantized algorithm is demonstrated in both MountainCar reinforcement learning environment and CartPole reinforcement learning environment-one kind of classical control reinforcement learning environment in the OpenAI Gym. The preliminary study results validate that, compared with Q-learning, this quantized reinforcement learning method has better control performance without considering the trade-off between exploration and exploitation. The learning performance of this new algorithm is stable with different learning rates from 0.01 to 0.10, which means it is promising to be employed in unknown dynamics systems.
引用
收藏
页码:455 / 464
页数:10
相关论文
共 50 条
  • [1] Quantum-enhanced reinforcement learning for control: a preliminary study
    Yazhou Hu
    Fengzhen Tang
    Jun Chen
    Wenxue Wang
    [J]. Control Theory and Technology, 2021, 19 : 455 - 464
  • [2] Investigating the Effects of Hyperparameters in Quantum-Enhanced Deep Reinforcement Learning
    Fikadu Tilaye, Getahun
    Pandey, Amit
    [J]. Quantum Engineering, 2023, 2023
  • [3] Quantum-Enhanced Machine Learning
    Dunjko, Vedran
    Taylor, Jacob M.
    Briegel, Hans J.
    [J]. PHYSICAL REVIEW LETTERS, 2016, 117 (13)
  • [4] Quantum-Enhanced Reinforcement Learning for Finite-Episode Games with Discrete State Spaces
    Neukart, Florian
    Von Dollen, David
    Seidel, Christian
    Compostella, Gabriele
    [J]. FRONTIERS IN PHYSICS, 2018, 5 : 71
  • [5] Supervised learning with quantum-enhanced feature spaces
    Vojtěch Havlíček
    Antonio D. Córcoles
    Kristan Temme
    Aram W. Harrow
    Abhinav Kandala
    Jerry M. Chow
    Jay M. Gambetta
    [J]. Nature, 2019, 567 : 209 - 212
  • [6] Supervised learning with quantum-enhanced feature spaces
    Havlicek, Vojtech
    Corcoles, Antonio D.
    Temme, Kristan
    Harrow, Aram W.
    Kandala, Abhinav
    Chow, Jerry M.
    Gambetta, Jay M.
    [J]. NATURE, 2019, 567 (7747) : 209 - 212
  • [7] Experimental Quantum-enhanced Cryptographic Remote Control
    Pang, Xiao-Ling
    Qiao, Lu-Feng
    Sun, Ke
    Liu, Yu
    Yang, Ai-lin
    Jin, Xian-Min
    [J]. SCIENTIFIC REPORTS, 2019, 9 (1)
  • [8] Experimental Quantum-enhanced Cryptographic Remote Control
    Xiao-Ling Pang
    Lu-Feng Qiao
    Ke Sun
    Yu Liu
    Ai-lin Yang
    Xian-Min Jin
    [J]. Scientific Reports, 9
  • [9] Quantum-Enhanced Control of a Tandem Queue System
    Stamatiou, George T.
    Magoutis, Kostas
    [J]. PERFORMANCE EVALUATION METHODOLOGIES AND TOOLS, VALUETOOLS 2023, 2024, 539 : 99 - 114
  • [10] Quantum-enhanced learning of rotations about an unknown direction
    Mo, Yin
    Chiribella, Giulio
    [J]. NEW JOURNAL OF PHYSICS, 2019, 21 (11):