Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning

Zhang, Hao; Wang, Hao; Kan, Zhen

doi:10.1109/LRA.2023.3290511

Computer Science > Robotics

arXiv:2209.13220 (cs)

[Submitted on 27 Sep 2022 (v1), last revised 17 Jul 2023 (this version, v2)]

Title:Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning

Authors:Hao Zhang, Hao Wang, Zhen Kan

View PDF

Abstract:Automaton based approaches have enabled robots to perform various complex tasks. However, most existing automaton based algorithms highly rely on the manually customized representation of states for the considered task, limiting its applicability in deep reinforcement learning algorithms. To address this issue, by incorporating Transformer into reinforcement learning, we develop a Double-Transformer-guided Temporal Logic framework (T2TL) that exploits the structural feature of Transformer twice, i.e., first encoding the LTL instruction via the Transformer module for efficient understanding of task instructions during the training and then encoding the context variable via the Transformer again for improved task performance. Particularly, the LTL instruction is specified by co-safe LTL. As a semantics-preserving rewriting operation, LTL progression is exploited to decompose the complex task into learnable sub-goals, which not only converts non-Markovian reward decision processes to Markovian ones, but also improves the sampling efficiency by simultaneous learning of multiple sub-tasks. An environment-agnostic LTL pre-training scheme is further incorporated to facilitate the learning of the Transformer module resulting in an improved representation of LTL. The simulation results demonstrate the effectiveness of the T2TL framework.

Comments:	IEEE Robotics and Automation Letters
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO)
Cite as:	arXiv:2209.13220 [cs.RO]
	(or arXiv:2209.13220v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2209.13220
Journal reference:	IEEE Robotics and Automation Letters, 2023
Related DOI:	https://doi.org/10.1109/LRA.2023.3290511

Submission history

From: Hao Zhang [view email]
[v1] Tue, 27 Sep 2022 07:41:11 UTC (4,669 KB)
[v2] Mon, 17 Jul 2023 06:08:33 UTC (5,516 KB)

Computer Science > Robotics

Title:Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators