Accelerating Reinforcement Learning through Implicit Imitation

Boutilier, C.; Price, B.

doi:10.1613/jair.898

Computer Science > Machine Learning

arXiv:1106.0681 (cs)

[Submitted on 3 Jun 2011]

Title:Accelerating Reinforcement Learning through Implicit Imitation

Authors:C. Boutilier, B. Price

View PDF

Abstract:Imitation can be viewed as a means of enhancing learning in multiagent environments. It augments an agent's ability to learn useful behaviors by making intelligent use of the knowledge implicit in behaviors demonstrated by cooperative teachers or other more experienced agents. We propose and study a formal model of implicit imitation that can accelerate reinforcement learning dramatically in certain cases. Roughly, by observing a mentor, a reinforcement-learning agent can extract information about its own capabilities in, and the relative value of, unvisited parts of the state space. We study two specific instantiations of this model, one in which the learning agent and the mentor have identical abilities, and one designed to deal with agents and mentors with different action sets. We illustrate the benefits of implicit imitation by integrating it with prioritized sweeping, and demonstrating improved performance and convergence through observation of single and multiple mentors. Though we make some stringent assumptions regarding observability and possible interactions, we briefly comment on extensions of the model that relax these restricitions.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:1106.0681 [cs.LG]
	(or arXiv:1106.0681v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1106.0681
Journal reference:	Journal Of Artificial Intelligence Research, Volume 19, pages 569-629, 2003
Related DOI:	https://doi.org/10.1613/jair.898

Submission history

From: C. Boutilier [view email] [via jair.org as proxy]
[v1] Fri, 3 Jun 2011 14:57:02 UTC (526 KB)

Computer Science > Machine Learning

Title:Accelerating Reinforcement Learning through Implicit Imitation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Accelerating Reinforcement Learning through Implicit Imitation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators