A sojourn-based approach to semi-Markov Reinforcement Learning

Ascione, Giacomo; Cuomo, Salvatore

Mathematics > Probability

arXiv:2201.06827 (math)

[Submitted on 18 Jan 2022 (v1), last revised 20 Apr 2022 (this version, v2)]

Title:A sojourn-based approach to semi-Markov Reinforcement Learning

Authors:Giacomo Ascione, Salvatore Cuomo

View PDF

Abstract:In this paper we introduce a new approach to discrete-time semi-Markov decision processes based on the sojourn time process. Different characterizations of discrete-time semi-Markov processes are exploited and decision processes are constructed by their means. With this new approach, the agent is allowed to consider different actions depending also on the sojourn time of the process in the current state. A numerical method based on $Q$-learning algorithms for finite horizon reinforcement learning and stochastic recursive relations is investigated. Finally, we consider two toy examples: one in which the reward depends on the sojourn-time, according to the gambler's fallacy; the other in which the environment is semi-Markov even if the reward function does not depend on the sojourn time. These are used to carry on some numerical evaluations on the previously presented $Q$-learning algorithm and on a different naive method based on deep reinforcement learning.

Comments:	31 pages, 27 figures
Subjects:	Probability (math.PR); Numerical Analysis (math.NA)
Cite as:	arXiv:2201.06827 [math.PR]
	(or arXiv:2201.06827v2 [math.PR] for this version)
	https://doi.org/10.48550/arXiv.2201.06827

Submission history

From: Giacomo Ascione [view email]
[v1] Tue, 18 Jan 2022 08:55:47 UTC (85 KB)
[v2] Wed, 20 Apr 2022 14:26:35 UTC (175 KB)

Mathematics > Probability

Title:A sojourn-based approach to semi-Markov Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Probability

Title:A sojourn-based approach to semi-Markov Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators