Design and tests of reinforcement-learning-based optimal power flow solution generator

Optimal power flow (OPF) is a very traditional problem in the research field of power systems. In this paper, an OPF solution generator based on reinforcement learning (RL) is proposed. The solution process of OPF is modeled as a one-step Markov Decision Process (MDP) and is solved using the Twin De...

Description complète

Enregistré dans:
Détails bibliographiques
Auteurs principaux: Hongyue Zhen, Hefeng Zhai, Weizhe Ma, Ligang Zhao, Yixuan Weng, Yuan Xu, Jun Shi, Xiaofeng He
Format: article
Langue:EN
Publié: Elsevier 2022
Sujets:
TD3
Accès en ligne:https://doaj.org/article/c86980c64de74a089d63e76c876a750a
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
Description
Résumé:Optimal power flow (OPF) is a very traditional problem in the research field of power systems. In this paper, an OPF solution generator based on reinforcement learning (RL) is proposed. The solution process of OPF is modeled as a one-step Markov Decision Process (MDP) and is solved using the Twin Delayed Deep Deterministic policy gradient (TD3) algorithm. A warm-up training mechanism is adopted to realize better initialization of neural networks. Parallel computing is utilized to expand the searching range and improve training efficiency. Numerical tests are carried out in the IEEE-39 system. The results prove the correctness and efficiency of the proposed algorithm. The actor (policy) network of the well-trained agent can serve as a fast optimal power flow solution generator and can be applied to online scenarios.