Local Differential Privacy for Regret Minimization in Reinforcement Learning

Garcelon, Evrard; Perchet, Vianney; Pike-Burke, Ciara; Pirotta, Matteo

Computer Science > Machine Learning

arXiv:2010.07778 (cs)

[Submitted on 15 Oct 2020 (v1), last revised 27 Oct 2021 (this version, v3)]

Title:Local Differential Privacy for Regret Minimization in Reinforcement Learning

Authors:Evrard Garcelon, Vianney Perchet, Ciara Pike-Burke, Matteo Pirotta

View PDF

Abstract:Reinforcement learning algorithms are widely used in domains where it is desirable to provide a personalized service. In these domains it is common that user data contains sensitive information that needs to be protected from third parties. Motivated by this, we study privacy in the context of finite-horizon Markov Decision Processes (MDPs) by requiring information to be obfuscated on the user side. We formulate this notion of privacy for RL by leveraging the local differential privacy (LDP) framework. We establish a lower bound for regret minimization in finite-horizon MDPs with LDP guarantees which shows that guaranteeing privacy has a multiplicative effect on the regret. This result shows that while LDP is an appealing notion of privacy, it makes the learning problem significantly more complex. Finally, we present an optimistic algorithm that simultaneously satisfies $\varepsilon$-LDP requirements, and achieves $\sqrt{K}/\varepsilon$ regret in any finite-horizon MDP after $K$ episodes, matching the lower bound dependency on the number of episodes $K$.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2010.07778 [cs.LG]
	(or arXiv:2010.07778v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2010.07778

Submission history

From: Evrard Garcelon [view email]
[v1] Thu, 15 Oct 2020 14:13:26 UTC (1,883 KB)
[v2] Tue, 16 Mar 2021 09:55:06 UTC (2,547 KB)
[v3] Wed, 27 Oct 2021 12:46:21 UTC (2,145 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-10

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Evrard Garcelon
Vianney Perchet
Ciara Pike-Burke
Matteo Pirotta

export BibTeX citation

Computer Science > Machine Learning

Title:Local Differential Privacy for Regret Minimization in Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Local Differential Privacy for Regret Minimization in Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators