Variational quantum compiling with double Q-learning

被引：25

作者：

He, Zhimin ^{[1
,2
]}

Li, Lvzhou ^{[3
]}

Zheng, Shenggen ^{[2
]}

Li, Yongyao ^{[4
]}

Situ, Haozhen ^{[5
]}

机构：

[1] Foshan Univ, Sch Elect & Informat Engn, Foshan 528000, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China

[3] Sun Yat Sen Univ, Inst Quantum Comp & Comp Sci Theory, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China

[4] Foshan Univ, Sch Phys & Optoelect Engn, Foshan 528000, Peoples R China

[5] South China Agr Univ, Coll Math & Informat, Guangzhou 510642, Peoples R China

来源：

NEW JOURNAL OF PHYSICS | 2021年 / 23卷 / 03期

基金：

中国国家自然科学基金;

关键词：

variational quantum compiling; reinforcement learning; double Q-learning;

D O I：

10.1088/1367-2630/abe0ae

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

Quantum compiling aims to construct a quantum circuit V by quantum gates drawn from a native gate alphabet, which is functionally equivalent to the target unitary U. It is a crucial stage for the running of quantum algorithms on noisy intermediate-scale quantum (NISQ) devices. However, the space for structure exploration of quantum circuit is enormous, resulting in the requirement of human expertise, hundreds of experimentations or modifications from existing quantum circuits. In this paper, we propose a variational quantum compiling (VQC) algorithm based on reinforcement learning, in order to automatically design the structure of quantum circuit for VQC with no human intervention. An agent is trained to sequentially select quantum gates from the native gate alphabet and the qubits they act on by double Q-learning with epsilon-greedy exploration strategy and experience replay. At first, the agent randomly explores a number of quantum circuits with different structures, and then iteratively discovers structures with higher performance on the learning task. Simulation results show that the proposed method can make exact compilations with less quantum gates compared to previous VQC algorithms. It can reduce the errors of quantum algorithms due to decoherence process and gate noise in NISQ devices, and enable quantum algorithms especially for complex algorithms to be executed within coherence time.

引用

页数：14

共 50 条

[31] Contextual Q-Learning
Pinto, Tiago
Vale, Zita
ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2927 - 2928
[32] CVaR Q-Learning
Stanko, Silvestr
Macek, Karel
COMPUTATIONAL INTELLIGENCE: 11th International Joint Conference, IJCCI 2019, Vienna, Austria, September 17-19, 2019, Revised Selected Papers, 2021, 922 : 333 - 358
[33] Bayesian Q-learning
Dearden, R
Friedman, N
Russell, S
FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, 1998, : 761 - 768
[34] Zap Q-Learning
Devraj, Adithya M.
Meyn, Sean P.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[35] Convex Q-Learning
Lu, Fan
Mehta, Prashant G.
Meyn, Sean P.
Neu, Gergely
2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 4749 - 4756
[36] Fuzzy Q-learning
Glorennec, PY
Jouffe, L
PROCEEDINGS OF THE SIXTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS I - III, 1997, : 659 - 662
[37] Q-learning and robotics
Touzet, CF
Santos, JM
SIMULATION IN INDUSTRY 2001, 2001, : 685 - 689
[38] Periodic Q-Learning
Lee, Donghwan
He, Niao
LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 582 - 598
[39] Q-learning automaton
Qian, F
Hirata, H
IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 432 - 437
[40] Mutual Q-learning
Reid, Cameron
Mukhopadhyay, Snehasis
2020 3RD INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTS (ICCR 2020), 2020, : 128 - 133

← 1 2 3 4 5 →