Learning from Imperfect Demonstrations via Adversarial Confidence Transfer

Zhangjie Cao,Zihan Wang,Dorsa Sadigh,Zhangjie Cao,Zihan Wang,Dorsa Sadigh

Existing learning from demonstration algorithms usually assume access to expert demonstrations. However, this assumption is limiting in many real-world applications since the collected demonstrations may be suboptimal or even consist of failure cases. We therefore study the problem of learning from imperfect demonstrations by learning a confidence predictor. Specifically, we rely on demonstrations...