A Q-learning based iterated local search algorithm for human-UAV cooperation in restoring transmission network

被引:1
|
作者
Xu, Ying [1 ,2 ]
Li, Xiaobo [3 ]
Meng, Xiangpei [1 ,2 ]
机构
[1] Ningbo Univ Finance & Econ, Coll Digital Technol & Engineer, Ningbo 315175, Zhejiang, Peoples R China
[2] Ningbo Univ, Fac Elect Engn & Comp Sci, Ningbo 315211, Zhejiang, Peoples R China
[3] Zhejiang Normal Univ, Sch Comp Sci & Technol, Jinhua 321004, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Power transmission network; Q-learning; Iterated local search; Collaborative scheduling; POWER-SYSTEM RESTORATION; SERVICE RESTORATION; RESCUE UNITS; OPTIMIZATION; DISASTER; INSPECTION;
D O I
10.1016/j.eswa.2024.124200
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The power transmission network is easily to be destroyed when natural or man-made disasters occur. Restoration of power supply under disaster environments faces difficulties since a large-scale network typically contains many uninspected faulty nodes. Utilizing unmanned aerial vehicles (UAVs) to inspect these unknown faulty nodes can significantly improve the efficiency for subsequent restoration work performed by human-teams. Nevertheless, efficient cooperation of UAV and human-team is a complicated work due to the complexity of network structure and correlation between UAV scheduling and human-team scheduling. In this paper, a mathematical model is established to describe the considered problem aiming at maximizing the restored power supply in a limited response time. Then a Q-learning based iterated local search (Q ILS) algorithm is proposed to formulate the collaborative scheduling problem. Firstly, an initialization method is designed to assign UAVs for inspecting unknown faulty nodes and human-teams for repairing faulty nodes, which ensures each unknown faulty node is inspected before maintenance. Secondly, searching operators including perturbation and local search procedures are designed to ensure exploration and exploitation capability. Thirdly, Q-learning method is utilized as a learning engine to guide the direction of solution evolution. Moreover, the parameters of Q ILS are calibrated by multi-factor analysis of variance method to determine proper values. The computational simulations and comparison experiments validate the superiority of proposed algorithm.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] A novel Q-learning based variable neighborhood iterative search algorithm for solving disassembly line scheduling problems
    Ren, Yaxian
    Gao, Kaizhou
    Fu, Yaping
    Sang, Hongyan
    Li, Dachao
    Luo, Zile
    SWARM AND EVOLUTIONARY COMPUTATION, 2023, 80
  • [32] Q-Learning Based MEP Search Algorithm and Coverage Enhancement Strategy in IoT-Enabled Intrusion Detection
    Yao, Yindi
    Tian, Yuying
    Li, Xiong
    Yang, Xuan
    Zhao, Bozhan
    Yang, Ying
    IEEE SENSORS JOURNAL, 2024, 24 (02) : 2180 - 2193
  • [33] A Trust Model Based on Fuzzy Q-learning Algorithm in Mobile P2P Network
    Cao, Xiaomei
    Wang, Jian
    Zhu, Haitao
    2015 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2015, : 113 - 116
  • [34] Sequential Channel Selection for Decentralized Cognitive Radio Sensor Network Based On Modified Q-Learning Algorithm
    Zeng, Fanzi
    Liu, Hanshan
    Xu, Jisheng
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 657 - 662
  • [35] Multi-objective virtual network embedding algorithm based on Q-learning and curiosity-driven
    He, Mengyang
    Zhuang, Lei
    Tian, Shuaikui
    Wang, Guoqing
    Zhang, Kunli
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2018,
  • [36] Dispatching Algorithm Design for Elevator Group Control System with Q-Learning based on a Recurrent Neural Network
    Liu, Weipeng
    Liu, Ning
    Sun, Hexu
    Xing, Guansheng
    Dong, Yan
    Chen, Haiyong
    2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 3397 - 3402
  • [37] Multi-objective virtual network embedding algorithm based on Q-learning and curiosity-driven
    Mengyang He
    Lei Zhuang
    Shuaikui Tian
    Guoqing Wang
    Kunli Zhang
    EURASIP Journal on Wireless Communications and Networking, 2018
  • [38] Reinforcement Learning-based control using Q-learning and gravitational search algorithm with experimental validation on a nonlinear servo system
    Zamfirache, Iuliu Alexandru
    Precup, Radu-Emil
    Roman, Raul-Cristian
    Petriu, Emil M.
    INFORMATION SCIENCES, 2022, 583 : 99 - 120
  • [39] Real-Time Data Transmission Scheduling Algorithm for Wireless Sensor Networks Based on Deep Q-Learning
    Zhang, Aiqi
    Sun, Meiyi
    Wang, Jiaqi
    Li, Zhiyi
    Cheng, Yanbo
    Wang, Cheng
    ELECTRONICS, 2022, 11 (12)
  • [40] Resilient Many-to-Many Network Design of Fourth-Party Logistics based on Iterated Local Search Algorithm
    Li Rui
    Tong Yujun
    Sun Fuming
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 9605 - 9609