Efficiently Learning Small Policies for Locomotion and Manipulation

robot,ICRA 2023

Shashank Hegde,Gaurav S. Sukhatme,Shashank Hegde,Gaurav S. Sukhatme

Neural control of memory-constrained, agile robots requires small, yet highly performant models. We leverage graph hyper networks to learn graph hyper policies trained with off-policy reinforcement learning resulting in networks that are two orders of magnitude smaller than commonly used networks yet encode policies comparable to those encoded by much larger networks trained on the same task. We s...

Efficiently Learning Small Policies for Locomotion and Manipulation

Shashank Hegde,Gaurav S. Sukhatme,Shashank Hegde,Gaurav S. Sukhatme

Discussion

Related Contents