Off-policy Imitation Learning from Visual Inputs
Zhihao Cheng,Li Shen,Dacheng Tao,Zhihao Cheng,Li Shen,Dacheng Tao
Recently, various successful applications utilizing expert states in imitation learning (IL) have been witnessed. However, IL from visual inputs (ILfVI), which has a greater promise to be widely applied by using online visual resources, suffers from low data-efficiency and poor performance resulted from on-policy learning and high-dimensional visual inputs. We propose OPIfVI (Off-Policy Imitation ...


