Information-Theoretic Safe Exploration with Gaussian Processes

Bottero, Alessandro G.; Luis, Carlos E.; Vinogradska, Julia; Berkenkamp, Felix; Peters, Jan

Computer Science > Machine Learning

arXiv:2212.04914 (cs)

[Submitted on 9 Dec 2022]

Title:Information-Theoretic Safe Exploration with Gaussian Processes

Authors:Alessandro G. Bottero, Carlos E. Luis, Julia Vinogradska, Felix Berkenkamp, Jan Peters

View PDF

Abstract:We consider a sequential decision making task where we are not allowed to evaluate parameters that violate an a priori unknown (safety) constraint. A common approach is to place a Gaussian process prior on the unknown constraint and allow evaluations only in regions that are safe with high probability. Most current methods rely on a discretization of the domain and cannot be directly extended to the continuous case. Moreover, the way in which they exploit regularity assumptions about the constraint introduces an additional critical hyperparameter. In this paper, we propose an information-theoretic safe exploration criterion that directly exploits the GP posterior to identify the most informative safe parameters to evaluate. Our approach is naturally applicable to continuous domains and does not require additional hyperparameters. We theoretically analyze the method and show that we do not violate the safety constraint with high probability and that we explore by learning about the constraint up to arbitrary precision. Empirical evaluations demonstrate improved data-efficiency and scalability.

Comments:	Submitted to NeurIPS 2022
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2212.04914 [cs.LG]
	(or arXiv:2212.04914v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2212.04914

Submission history

From: Alessandro Bottero [view email]
[v1] Fri, 9 Dec 2022 15:23:58 UTC (6,944 KB)

Computer Science > Machine Learning

Title:Information-Theoretic Safe Exploration with Gaussian Processes

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Information-Theoretic Safe Exploration with Gaussian Processes

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators