Local Policy Optimization for Trajectory-Centric Reinforcement Learning

Patrik Kolaric,Devesh K. Jha,Arvind U. Raghunathan,Frank L. Lewis,Mouhacine Benosman,Diego Romeres,Daniel Nikovski,Patrik Kolaric,Devesh K. Jha,Arvind U. Raghunathan,Frank L. Lewis,Mouhacine Benosman,Diego Romeres,Daniel Nikovski

The goal of this paper is to present a method for simultaneous trajectory and local stabilizing policy optimization to generate local policies for trajectory-centric model-based reinforcement learning (MBRL). This is motivated by the fact that global policy optimization for non-linear systems could be a very challenging problem both algorithmically and numerically. However, a lot of robotic manipu...