Start State Selection for Control Policy Learning from Optimal Trajectories

Christoph Zelch,Jan Peters,Oskar von Stryk,Christoph Zelch,Jan Peters,Oskar von Stryk

Combination of optimal control methods and machine learning approaches allows to profit from complementary benefits of each field in control of robotic systems. Data from optimal trajectories provides valuable information that can be used to learn a near-optimal state-dependent feedback control policy. To obtain high-quality learning data, careful selection of optimal trajectories, determined by a...