Safety Optimized Reinforcement Learning via Multi-Objective Policy Optimization
Homayoun Honari,Mehran Ghafarian Tamizi,Homayoun Najjaran,Homayoun Honari,Mehran Ghafarian Tamizi,Homayoun Najjaran
Safe reinforcement learning (Safe RL) refers to a class of techniques that aim to prevent RL algorithms from violating constraints in the process of decision-making and exploration during trial and error. In this paper, a novel model-free Safe RL algorithm, formulated based on the multi-objective policy optimization framework is introduced where the policy is optimized towards optimality and safet...