Learning Control Policies from Optimal Trajectories

Christoph Zelch,Jan Peters,Oskar von Stryk,Christoph Zelch,Jan Peters,Oskar von Stryk

The ability to optimally control robotic systems offers significant advantages for their performance. While time-dependent optimal trajectories can numerically be computed for high dimensional nonlinear system dynamic models, constraints and objectives, finding optimal feedback control policies for such systems is hard. This is unfortunate, as without a policy, the control of real-world systems re...