Path Integral Control by Reproducing Kernel Hilbert Space Embedding

Rawlik, Konrad; Toussaint, Marc; Vijayakumar, Sethu

Computer Science > Machine Learning

arXiv:1208.2523 (cs)

[Submitted on 13 Aug 2012]

Title:Path Integral Control by Reproducing Kernel Hilbert Space Embedding

Authors:Konrad Rawlik, Marc Toussaint, Sethu Vijayakumar

View PDF

Abstract:We present an embedding of stochastic optimal control problems, of the so called path integral form, into reproducing kernel Hilbert spaces. Using consistent, sample based estimates of the embedding leads to a model free, non-parametric approach for calculation of an approximate solution to the control problem. This formulation admits a decomposition of the problem into an invariant and task dependent component. Consequently, we make much more efficient use of the sample data compared to previous sample based approaches in this domain, e.g., by allowing sample re-use across tasks. Numerical examples on test problems, which illustrate the sample efficiency, are provided.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1208.2523 [cs.LG]
	(or arXiv:1208.2523v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1208.2523

Submission history

From: Konrad Rawlik [view email]
[v1] Mon, 13 Aug 2012 08:30:14 UTC (437 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2012-08

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Konrad Rawlik
Marc Toussaint
Sethu Vijayakumar

export BibTeX citation

Computer Science > Machine Learning

Title:Path Integral Control by Reproducing Kernel Hilbert Space Embedding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Path Integral Control by Reproducing Kernel Hilbert Space Embedding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators