Disturbance-injected Robust Imitation Learning with Task Achievement
Hirotaka Tahara,Hikaru Sasaki,Hanbit Oh,Brendan Michael,Takamitsu Matsubara,Hirotaka Tahara,Hikaru Sasaki,Hanbit Oh,Brendan Michael,Takamitsu Matsubara
Robust imitation learning using disturbance injections overcomes issues of limited variation in demonstrations. However, these methods assume demonstrations are optimal, and that policy stabilization can be learned via simple augmentations. In real-world scenarios, demonstrations are often of diverse-quality, and disturbance injection instead learns sub-optimal policies that fail to replicate desi...