共 25 条
Demand response-based autonomous voltage control for transmission and active distribution networks using modified partially observed Markov decision process model
被引:1
|作者:
Gu, Yaru
[1
]
Huang, Xueliang
[1
]
机构:
[1] Southeast Univ, Sch Elect Engn, Nanjing, Peoples R China
关键词:
artificial intelligence;
data mining;
decision making;
demand side management;
fault tolerance;
hidden Markov models;
voltage control;
FRAMEWORK;
D O I:
10.1049/gtd2.13027
中图分类号:
TM [电工技术];
TN [电子技术、通信技术];
学科分类号:
0808 ;
0809 ;
摘要:
To fully utilize the voltage regulation capacity and interaction characteristics of the Transmission and Distribution (T&D) system, a novel Modified Partially Observed Markov Decision Process (MPOMDP)-based Reinforcement Learning (RL) scheme for Autonomous Voltage Control is proposed, which incorporates Demand Response (DR) and cooperation with the Transmission Network . The proposed scheme consists of two vital components: an MPOMDP block and a Modified Asynchronous Advantage Actor-Critic-based RL block. The MPOMDP block innovatively exploits the confidence interval of the observed state to make a better perception of the precise system state by introducing two new probability vectors. Then the MPOMDP block is fed into the underlying architecture of the RL block for asynchronously capturing features and optimal decision-making, where the solving framework additionally brings in a public data buffer to realize boundary information sharing. Case studies are conducted on a modified T&D system considering N-1 contingencies, with a training dataset from a district in Suzhou, China. Simulation results demonstrate that the proposed scheme can achieve significant voltage optimization while ensuring fast convergence speed. A novel Modified Partially Observed Markov Decision Process (MPOMDP)-based Reinforcement Learning (RL) scheme for Autonomous Voltage Control is proposed, which incorporates Demand Response and cooperation with the Transmission Network. The proposed scheme consists of two vital components: an MPOMDP block which makes a better perception of the precise system state and a Modified Asynchronous Advantage Actor-Critic-based RL block which assists agents asynchronously to capture features and generate optimal decision-making.image
引用
收藏
页码:5155 / 5170
页数:16
相关论文