Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive Navigation

Jinyoung Choi,Christopher Dance,Jung-Eun Kim,Seulbin Hwang,Kyung-Sik Park,Jinyoung Choi,Christopher Dance,Jung-Eun Kim,Seulbin Hwang,Kyung-Sik Park

Modern navigation algorithms based on deep reinforcement learning (RL) show promising efficiency and robustness. However, most deep RL algorithms operate in a risk-neutral manner, making no special attempt to shield users from relatively rare but serious outcomes, even if such shielding might cause little loss of performance. Furthermore, such algorithms typically make no provisions to ensure safe...