Learning Stochastic Parametric Differentiable Predictive Control Policies

Drgoňa, Ján; Mukherjee, Sayak; Tuor, Aaron; Halappanavar, Mahantesh; Vrabie, Draguna

Computer Science > Machine Learning

arXiv:2203.01447 (cs)

[Submitted on 2 Mar 2022 (v1), last revised 22 May 2022 (this version, v2)]

Title:Learning Stochastic Parametric Differentiable Predictive Control Policies

Authors:Ján Drgoňa, Sayak Mukherjee, Aaron Tuor, Mahantesh Halappanavar, Draguna Vrabie

View PDF

Abstract:The problem of synthesizing stochastic explicit model predictive control policies is known to be quickly intractable even for systems of modest complexity when using classical control-theoretic methods. To address this challenge, we present a scalable alternative called stochastic parametric differentiable predictive control (SP-DPC) for unsupervised learning of neural control policies governing stochastic linear systems subject to nonlinear chance constraints. SP-DPC is formulated as a deterministic approximation to the stochastic parametric constrained optimal control problem. This formulation allows us to directly compute the policy gradients via automatic differentiation of the problem's value function, evaluated over sampled parameters and uncertainties. In particular, the computed expectation of the SP-DPC problem's value function is backpropagated through the closed-loop system rollouts parametrized by a known nominal system dynamics model and neural control policy which allows for direct model-based policy optimization. We provide theoretical probabilistic guarantees for policies learned via the SP-DPC method on closed-loop stability and chance constraints satisfaction. Furthermore, we demonstrate the computational efficiency and scalability of the proposed policy optimization algorithm in three numerical examples, including systems with a large number of states or subject to nonlinear constraints.

Comments:	Full version for the paper accepted at the 10th IFAC Symposium on Robust Control Design (ROCOND) 2022
Subjects:	Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2203.01447 [cs.LG]
	(or arXiv:2203.01447v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2203.01447

Submission history

From: Sayak Mukherjee [view email]
[v1] Wed, 2 Mar 2022 22:46:32 UTC (405 KB)
[v2] Sun, 22 May 2022 00:55:30 UTC (1,102 KB)

Computer Science > Machine Learning

Title:Learning Stochastic Parametric Differentiable Predictive Control Policies

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning Stochastic Parametric Differentiable Predictive Control Policies

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators