Safety Correction from Baseline: Towards the Risk-aware Policy in Robotics via Dual-agent Reinforcement Learning

robot,IROS 2022

Linrui Zhang,Zichen Yan,Li Shen,Shoujie Li,Xueqian Wang,Dacheng Tao,Linrui Zhang,Zichen Yan,Li Shen,Shoujie Li,Xueqian Wang,Dacheng Tao

Learning a risk-aware policy is essential but rather challenging in unstructured robotic tasks. Safe reinforcement learning methods open up new possibilities to tackle this problem. However, the conservative policy updates make it intractable to achieve sufficient exploration and desirable performance in complex, sample-expensive environments. In this paper, we propose a dual-agent safe reinforcem...

Safety Correction from Baseline: Towards the Risk-aware Policy in Robotics via Dual-agent Reinforcement Learning

Linrui Zhang,Zichen Yan,Li Shen,Shoujie Li,Xueqian Wang,Dacheng Tao,Linrui Zhang,Zichen Yan,Li Shen,Shoujie Li,Xueqian Wang,Dacheng Tao

Discussion

Related Contents