Design and tests of reinforcement-learning-based optimal power flow solution generator

Optimal power flow (OPF) is a very traditional problem in the research field of power systems. In this paper, an OPF solution generator based on reinforcement learning (RL) is proposed. The solution process of OPF is modeled as a one-step Markov Decision Process (MDP) and is solved using the Twin De...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Hongyue Zhen, Hefeng Zhai, Weizhe Ma, Ligang Zhao, Yixuan Weng, Yuan Xu, Jun Shi, Xiaofeng He
Formato: article
Lenguaje:EN
Publicado: Elsevier 2022
Materias:
TD3
Acceso en línea:https://doaj.org/article/c86980c64de74a089d63e76c876a750a
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
Descripción
Sumario:Optimal power flow (OPF) is a very traditional problem in the research field of power systems. In this paper, an OPF solution generator based on reinforcement learning (RL) is proposed. The solution process of OPF is modeled as a one-step Markov Decision Process (MDP) and is solved using the Twin Delayed Deep Deterministic policy gradient (TD3) algorithm. A warm-up training mechanism is adopted to realize better initialization of neural networks. Parallel computing is utilized to expand the searching range and improve training efficiency. Numerical tests are carried out in the IEEE-39 system. The results prove the correctness and efficiency of the proposed algorithm. The actor (policy) network of the well-trained agent can serve as a fast optimal power flow solution generator and can be applied to online scenarios.