Cross Domain Policy Transfer with Effect Cycle-Consistency

Ruiqi Zhu,Tianhong Dai,Oya Celiktutan,Ruiqi Zhu,Tianhong Dai,Oya Celiktutan

Training a robotic policy from scratch using deep reinforcement learning methods can be prohibitively expensive due to sample inefficiency. To address this challenge, transferring policies trained in the source domain to the target domain becomes an attractive paradigm. Previous research has typically focused on domains with similar state and action spaces but differing in other aspects. In this p...