Learning to Solve Tasks with Exploring Prior Behaviours

Ruiqi Zhu,Siyuan Li,Tianhong Dai,Chongjie Zhang,Oya Celiktutan,Ruiqi Zhu,Siyuan Li,Tianhong Dai,Chongjie Zhang,Oya Celiktutan

Demonstrations are widely used in Deep Reinforcement Learning (DRL) for facilitating solving tasks with sparse rewards. However, the tasks in real-world scenarios can often have varied initial conditions from the demonstration, which would require additional prior behaviours. For example, consider we are given the demonstration for the task of picking up an object from an open drawer, but the draw...