Real-Time Learning-Based Model Predictive Control: Online Algorithms and Applications in Energy Systems
OPEN ACCESS
Loading...
Author / Producer
Date
2021-03
Publication Type
Master Thesis
ETH Bibliography
yes
Citations
Altmetric
OPEN ACCESS
Data
Rights / License
Abstract
The increased availability of sensing and computational capabilities in modern cyber-physical systems and networked systems has led to a growing interest in learning and data-driven control techniques. Learning-Based Model Predictive Control (LBMPC), i.e. the integration of learning methods in Model Predictive Control schemes, is one technique with potential applications for the control of dynamical systems under uncertain and stochastic conditions including humans in the control loop. In real-time applications of LBMPC, control solutions must be achieved in limited time, given the computational burden of both the function-learning and control mechanisms. Furthermore, models and functions associated with users must be learned on the fly from possibly parsimonious feedback. In this work, a framework is proposed for a LBMPC scheme in which little or no prior information is known regarding the dynamic state functions of multiple devices and the cost functions modeling satisfaction, comfort or sense of safety of users interacting with these devices. Gaussian Processes are advocated as a non-parametric nonlinear function modeling technique to enable a low sampling rate for black-box dynamic state and dissatisfaction functions. A regularized primal-dual gradient-based optimization algorithm is adapted to the discrete-time case and integrated with the LBMPC framework to facilitate convergence in a real-time setting with a Q-linear tracking error. Considering the rapidly-evolving demands on the power grid as a result of increasing distributed energy resources, the proposed methodology is implemented for a typical Demand Response application, in which the power setpoints of a network of black-box thermostatically-controlled loads in a building are optimized to provide ancillary services to the grid under fluctuating power demands while minimizing costs incurred to the black-box users inhabiting the building. The methodology is also applied to the classic inverted pendulum problem for the purposes of comparing results across true and GP-learned dynamic state and cost functions.
Permanent link
Publication status
published
External links
Editor
Contributors
Examiner : Dall’Anese, Emiliano
Examiner : Dörfler, Florian
Book title
Journal / series
Volume
Pages / Article No.
Publisher
ETH Zurich
Event
Edition / version
Methods
Software
Geographic location
Date collected
Date created
Subject
Organisational unit
09478 - Dörfler, Florian / Dörfler, Florian