OPT-Mimic: Imitation of Optimized Trajectories for Dynamic Quadruped Behaviors

Fuchioka, Yuni; Xie, Zhaoming; van de Panne, Michiel

Computer Science > Robotics

arXiv:2210.01247 (cs)

[Submitted on 3 Oct 2022 (v1), last revised 23 Mar 2023 (this version, v3)]

Title:OPT-Mimic: Imitation of Optimized Trajectories for Dynamic Quadruped Behaviors

Authors:Yuni Fuchioka, Zhaoming Xie, Michiel van de Panne

View PDF

Abstract:Reinforcement Learning (RL) has seen many recent successes for quadruped robot control. The imitation of reference motions provides a simple and powerful prior for guiding solutions towards desired solutions without the need for meticulous reward design. While much work uses motion capture data or hand-crafted trajectories as the reference motion, relatively little work has explored the use of reference motions coming from model-based trajectory optimization. In this work, we investigate several design considerations that arise with such a framework, as demonstrated through four dynamic behaviours: trot, front hop, 180 backflip, and biped stepping. These are trained in simulation and transferred to a physical Solo 8 quadruped robot without further adaptation. In particular, we explore the space of feed-forward designs afforded by the trajectory optimizer to understand its impact on RL learning efficiency and sim-to-real transfer. These findings contribute to the long standing goal of producing robot controllers that combine the interpretability and precision of model-based optimization with the robustness that model-free RL-based controllers offer.

Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2210.01247 [cs.RO]
	(or arXiv:2210.01247v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2210.01247

Submission history

From: Yuni Fuchioka [view email]
[v1] Mon, 3 Oct 2022 21:58:25 UTC (3,509 KB)
[v2] Fri, 4 Nov 2022 20:36:42 UTC (3,510 KB)
[v3] Thu, 23 Mar 2023 21:50:08 UTC (3,510 KB)

Computer Science > Robotics

Title:OPT-Mimic: Imitation of Optimized Trajectories for Dynamic Quadruped Behaviors

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:OPT-Mimic: Imitation of Optimized Trajectories for Dynamic Quadruped Behaviors

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators