Bayesian Disturbance Injection: Robust Imitation Learning of Flexible Policies

Hanbit Oh,Hikaru Sasaki,Brendan Michael,Takamitsu Matsubara,Hanbit Oh,Hikaru Sasaki,Brendan Michael,Takamitsu Matsubara

Scenarios requiring humans to choose from multiple seemingly optimal actions are commonplace, however standard imitation learning often fails to capture this behavior. Instead, an over-reliance on replicating expert actions induces inflexible and unstable policies, leading to poor generalizability in an application. To address the problem, this paper presents the first imitation learning framework...