Operation of Distributed Battery Considering Demand Response Using Deep Reinforcement Learning in Grid Edge Control
Battery energy storage systems (BESSs) are able to facilitate economical operation of the grid through demand response (DR), and are regarded as the most significant DR resource. Among them, distributed BESS integrating home photovoltaics (PV) have developed rapidly, and account for nearly 40% of ne...
Guardado en:
Autores principales: | , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
MDPI AG
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/445e25fbd8364979ad3639d27eb2c7de |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:445e25fbd8364979ad3639d27eb2c7de |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:445e25fbd8364979ad3639d27eb2c7de2021-11-25T17:28:26ZOperation of Distributed Battery Considering Demand Response Using Deep Reinforcement Learning in Grid Edge Control10.3390/en142277491996-1073https://doaj.org/article/445e25fbd8364979ad3639d27eb2c7de2021-11-01T00:00:00Zhttps://www.mdpi.com/1996-1073/14/22/7749https://doaj.org/toc/1996-1073Battery energy storage systems (BESSs) are able to facilitate economical operation of the grid through demand response (DR), and are regarded as the most significant DR resource. Among them, distributed BESS integrating home photovoltaics (PV) have developed rapidly, and account for nearly 40% of newly installed capacity. However, the use scenarios and use efficiency of distributed BESS are far from sufficient to be able to utilize the potential loads and overcome uncertainties caused by disorderly operation. In this paper, the low-voltage transformer-powered area (LVTPA) is firstly defined, and then a DR grid edge controller was implemented based on deep reinforcement learning to maximize the total DR benefits and promote three-phase balance in the LVTPA. The proposed DR problem is formulated as a Markov decision process (MDP). In addition, the deep deterministic policy gradient (DDPG) algorithm is applied to train the controller in order to learn the optimal DR strategy. Additionally, a life cycle cost model of the BESS is established and implemented in the DR scheme to measure the income. The numerical results, compared to deep Q learning and model-based methods, demonstrate the effectiveness and validity of the proposed method.Wenying LiMing TangXinzhen ZhangDanhui GaoJian WangMDPI AGarticlegrid edge controldemand response (DR)deep reinforcement learning (DRL)multi-agent algorithmdistributed battery energy storage system (BESS)three-phase unbalanceTechnologyTENEnergies, Vol 14, Iss 7749, p 7749 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
grid edge control demand response (DR) deep reinforcement learning (DRL) multi-agent algorithm distributed battery energy storage system (BESS) three-phase unbalance Technology T |
spellingShingle |
grid edge control demand response (DR) deep reinforcement learning (DRL) multi-agent algorithm distributed battery energy storage system (BESS) three-phase unbalance Technology T Wenying Li Ming Tang Xinzhen Zhang Danhui Gao Jian Wang Operation of Distributed Battery Considering Demand Response Using Deep Reinforcement Learning in Grid Edge Control |
description |
Battery energy storage systems (BESSs) are able to facilitate economical operation of the grid through demand response (DR), and are regarded as the most significant DR resource. Among them, distributed BESS integrating home photovoltaics (PV) have developed rapidly, and account for nearly 40% of newly installed capacity. However, the use scenarios and use efficiency of distributed BESS are far from sufficient to be able to utilize the potential loads and overcome uncertainties caused by disorderly operation. In this paper, the low-voltage transformer-powered area (LVTPA) is firstly defined, and then a DR grid edge controller was implemented based on deep reinforcement learning to maximize the total DR benefits and promote three-phase balance in the LVTPA. The proposed DR problem is formulated as a Markov decision process (MDP). In addition, the deep deterministic policy gradient (DDPG) algorithm is applied to train the controller in order to learn the optimal DR strategy. Additionally, a life cycle cost model of the BESS is established and implemented in the DR scheme to measure the income. The numerical results, compared to deep Q learning and model-based methods, demonstrate the effectiveness and validity of the proposed method. |
format |
article |
author |
Wenying Li Ming Tang Xinzhen Zhang Danhui Gao Jian Wang |
author_facet |
Wenying Li Ming Tang Xinzhen Zhang Danhui Gao Jian Wang |
author_sort |
Wenying Li |
title |
Operation of Distributed Battery Considering Demand Response Using Deep Reinforcement Learning in Grid Edge Control |
title_short |
Operation of Distributed Battery Considering Demand Response Using Deep Reinforcement Learning in Grid Edge Control |
title_full |
Operation of Distributed Battery Considering Demand Response Using Deep Reinforcement Learning in Grid Edge Control |
title_fullStr |
Operation of Distributed Battery Considering Demand Response Using Deep Reinforcement Learning in Grid Edge Control |
title_full_unstemmed |
Operation of Distributed Battery Considering Demand Response Using Deep Reinforcement Learning in Grid Edge Control |
title_sort |
operation of distributed battery considering demand response using deep reinforcement learning in grid edge control |
publisher |
MDPI AG |
publishDate |
2021 |
url |
https://doaj.org/article/445e25fbd8364979ad3639d27eb2c7de |
work_keys_str_mv |
AT wenyingli operationofdistributedbatteryconsideringdemandresponseusingdeepreinforcementlearningingridedgecontrol AT mingtang operationofdistributedbatteryconsideringdemandresponseusingdeepreinforcementlearningingridedgecontrol AT xinzhenzhang operationofdistributedbatteryconsideringdemandresponseusingdeepreinforcementlearningingridedgecontrol AT danhuigao operationofdistributedbatteryconsideringdemandresponseusingdeepreinforcementlearningingridedgecontrol AT jianwang operationofdistributedbatteryconsideringdemandresponseusingdeepreinforcementlearningingridedgecontrol |
_version_ |
1718412280985550848 |