Predicting optimal value functions by interpolating reward functions in scalarized multi-objective reinforcement learning

robot,ICRA 2020

Arpan Kusari,Jonathan P. How,Arpan Kusari,Jonathan P. How

A common approach for defining a reward function for multi-objective reinforcement learning (MORL) problems is the weighted sum of the multiple objectives. The weights are then treated as design parameters dependent on the expertise (and preference) of the person performing the learning, with the typical result that a new solution is required for any change in these settings. This paper investigat...

Predicting optimal value functions by interpolating reward functions in scalarized multi-objective reinforcement learning

Arpan Kusari,Jonathan P. How,Arpan Kusari,Jonathan P. How

Discussion

Related Contents