Work in Progress: Temporally Extended Auxiliary Tasks

Sherstan, Craig; Kartal, Bilal; Hernandez-Leal, Pablo; Taylor, Matthew E.

Computer Science > Machine Learning

arXiv:2004.00600 (cs)

[Submitted on 1 Apr 2020 (v1), last revised 16 Apr 2020 (this version, v3)]

Title:Work in Progress: Temporally Extended Auxiliary Tasks

Authors:Craig Sherstan, Bilal Kartal, Pablo Hernandez-Leal, Matthew E. Taylor

View PDF

Abstract:Predictive auxiliary tasks have been shown to improve performance in numerous reinforcement learning works, however, this effect is still not well understood. The primary purpose of the work presented here is to investigate the impact that an auxiliary task's prediction timescale has on the agent's policy performance. We consider auxiliary tasks which learn to make on-policy predictions using temporal difference learning. We test the impact of prediction timescale using a specific form of auxiliary task in which the input image is used as the prediction target, which we refer to as temporal difference autoencoders (TD-AE). We empirically evaluate the effect of TD-AE on the A2C algorithm in the VizDoom environment using different prediction timescales. While we do not observe a clear relationship between the prediction timescale on performance, we make the following observations: 1) using auxiliary tasks allows us to reduce the trajectory length of the A2C algorithm, 2) in some cases temporally extended TD-AE performs better than a straight autoencoder, 3) performance with auxiliary tasks is sensitive to the weight placed on the auxiliary loss, 4) despite this sensitivity, auxiliary tasks improved performance without extensive hyper-parameter tuning. Our overall conclusions are that TD-AE increases the robustness of the A2C algorithm to the trajectory length and while promising, further study is required to fully understand the relationship between auxiliary task prediction timescale and the agent's performance.

Comments:	Accepted for the Adaptive and Learning Agents (ALA) Workshop at AAMAS 2020
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2004.00600 [cs.LG]
	(or arXiv:2004.00600v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2004.00600

Submission history

From: Craig Sherstan [view email]
[v1] Wed, 1 Apr 2020 17:36:14 UTC (325 KB)
[v2] Mon, 6 Apr 2020 22:45:14 UTC (325 KB)
[v3] Thu, 16 Apr 2020 21:42:56 UTC (325 KB)

Computer Science > Machine Learning

Title:Work in Progress: Temporally Extended Auxiliary Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Work in Progress: Temporally Extended Auxiliary Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators