Coherent beam combination based on Q-learning algorithm

被引:17
|
作者
Zhang, Xi [1 ]
Li, Pingxue [1 ]
Zhu, Yunchen [1 ]
Li, Chunyong [2 ]
Yao, Chuanfei [1 ]
Wang, Luo [1 ]
Dong, Xueyan [1 ]
Li, Shun [1 ]
机构
[1] Beijing Univ Technol, Fac Mat & Mfg, Inst Ultrashort Pulsed Laser & Applicat, Beijing 10024, Peoples R China
[2] Univ Durham, Dept Phys, South Rd, Durham DH1 3LE, England
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Coherent beam combination; Q-learning algorithm; Stochastic parallel gradient descent; optimization algorithm; FIBER LASERS;
D O I
10.1016/j.optcom.2021.126930
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Coherent beam combination (CBC) is an effective method to break the limiting power of a single fiber laser. The Q-learning algorithm is one of the reinforcement learning algorithms. We use the Q-learning algorithm to do phase compensation in the field of CBC. The performance difference between the Q-learning algorithm and the stochastic parallel gradient descent optimization algorithm (SPGD) is analyzed by simulating time-domain coherent synthesis. The results show that the Q-learning algorithm is easier to debug and has better stability.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Backward Q-learning: The combination of Sarsa algorithm and Q-learning
    Wang, Yin-Hao
    Li, Tzuu-Hseng S.
    Lin, Chih-Jui
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (09) : 2184 - 2193
  • [2] Combination Optimization Model of Urban Key Intersections Based on Q-Learning Algorithm
    Dong, Dan-Ping
    Wei, Fu-Lu
    Chen, Ming-Tao
    Guo, Yong-Qing
    Yang, Chang-Hai
    Han, Yu-Xin
    [J]. CICTP 2023: INNOVATION-EMPOWERED TECHNOLOGY FOR SUSTAINABLE, INTELLIGENT, DECARBONIZED, AND CONNECTED TRANSPORTATION, 2023, : 849 - 859
  • [3] An ARM-based Q-learning algorithm
    Hsu, Yuan-Pao
    Hwang, Kao-Shing
    Lin, Hsin-Yi
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2007, 2 : 11 - +
  • [4] A Task Scheduling Algorithm Based on Q-Learning for WSNs
    Zhang, Benhong
    Wu, Wensheng
    Bi, Xiang
    Wang, Yiming
    [J]. COMMUNICATIONS AND NETWORKING, CHINACOM 2018, 2019, 262 : 521 - 530
  • [5] Power Control Algorithm Based on Q-Learning in Femtocell
    Li Yun
    Tang Ying
    Liu Hanxiao
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (11) : 2557 - 2564
  • [6] Q-Learning Algorithm Based on Incremental RBF Network
    Hu Y.
    Li D.
    He Y.
    Han J.
    [J]. Jiqiren/Robot, 2019, 41 (05): : 562 - 573
  • [7] A new Q-learning algorithm based on the Metropolis criterion
    Guo, MZ
    Liu, Y
    Malec, J
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2004, 34 (05): : 2140 - 2143
  • [8] Adaptive PID controller based on Q-learning algorithm
    Shi, Qian
    Lam, Hak-Keung
    Xiao, Bo
    Tsai, Shun-Hung
    [J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2018, 3 (04) : 235 - 244
  • [9] Hexagon-based Q-learning algorithm and applications
    Yang, Hyun-Chang
    Kim, Ho-Duck
    Yoon, Han-Ul
    Jang, In-Hun
    Sim, Kwee-Bo
    [J]. INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2007, 5 (05) : 570 - 576
  • [10] Ramp Metering Control Based on the Q-Learning Algorithm
    Ivanjko, Edouard
    Necoska, Daniela Koltovska
    Greguric, Martin
    Vujic, Miroslav
    Jurkovic, Goran
    Mandzuka, Sadko
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2015, 15 (05) : 88 - 97