Battery control with lookahead constraints in distribution grids using reinforcement learning


Loading...

Date

2022-10

Publication Type

Journal Article

ETH Bibliography

yes

Citations

Altmetric

Data

Abstract

In this paper, a computationally efficient real-time control of a battery with lookahead state-of-energy constraints in active distribution grids with distributed energy sources is presented. The goal is to follow a previously computed dispatch plan or to optimize a monetary cost from buying and selling power at the point of common coupling. However, the lookahead constraints render the battery decisions non-trivial. The current practice in literature to solve this problem is Model Predictive Control (MPC), which does not scale for large grids. Instead, here, we propose a reinforcement learning approach based on the Deep Deterministic Policy Gradient (DDPG) algorithm. To satisfy the lookahead battery constraints we adapt the experience replay technique used in DDPG. To guarantee the satisfaction of the hard grid constraints, we introduce a safety layer that performs constrained optimization. Our approach does not need forecasts contrary to MPC. We perform evaluations on a realistic grid and comparisons with Lyapunov optimization and MPC. We show that we can achieve costs close to MPC and Lyapunov, while reducing the computational time by multiple orders of magnitude.

Publication status

published

Editor

Book title

Volume

211

Pages / Article No.

108551

Publisher

Elsevier

Event

Edition / version

Methods

Software

Geographic location

Date collected

Date created

Subject

Active distribution grids; Battery energy storage; Lookahead constraints; Real-time control; Deep reinforcement learning

Organisational unit

09481 - Hug, Gabriela / Hug, Gabriela check_circle
00002 - ETH Zürich

Notes

Funding

Related publications and datasets