GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems

Sukhija, Bhavya; Turchetta, Matteo; Lindner, David; Krause, Andreas; Trimpe, Sebastian; Baumann, Dominik

doi:10.1016/j.artint.2023.103922

Computer Science > Machine Learning

arXiv:2201.09562 (cs)

[Submitted on 24 Jan 2022 (v1), last revised 12 Jun 2023 (this version, v5)]

Title:GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems

Authors:Bhavya Sukhija, Matteo Turchetta, David Lindner, Andreas Krause, Sebastian Trimpe, Dominik Baumann

View PDF

Abstract:Learning optimal control policies directly on physical systems is challenging since even a single failure can lead to costly hardware damage. Most existing model-free learning methods that guarantee safety, i.e., no failures, during exploration are limited to local optima. A notable exception is the GoSafe algorithm, which, unfortunately, cannot handle high-dimensional systems and hence cannot be applied to most real-world dynamical systems. This work proposes GoSafeOpt as the first algorithm that can safely discover globally optimal policies for high-dimensional systems while giving safety and optimality guarantees. We demonstrate the superiority of GoSafeOpt over competing model-free safe learning methods on a robot arm that would be prohibitive for GoSafe.

Subjects:	Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2201.09562 [cs.LG]
	(or arXiv:2201.09562v5 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2201.09562
Journal reference:	Artificial Intelligence, Volume 320, Year 2023
Related DOI:	https://doi.org/10.1016/j.artint.2023.103922

Submission history

From: Bhavya Sukhija [view email]
[v1] Mon, 24 Jan 2022 10:05:44 UTC (22,535 KB)
[v2] Tue, 25 Jan 2022 06:47:44 UTC (22,535 KB)
[v3] Thu, 31 Mar 2022 14:53:39 UTC (22,237 KB)
[v4] Wed, 12 Apr 2023 21:56:33 UTC (6,447 KB)
[v5] Mon, 12 Jun 2023 12:20:59 UTC (6,447 KB)

Computer Science > Machine Learning

Title:GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators