Predicting optimal value functions by interpolating reward functions in scalarized multi-objective reinforcement learning
Arpan Kusari,Jonathan P. How,Arpan Kusari,Jonathan P. How
A common approach for defining a reward function for multi-objective reinforcement learning (MORL) problems is the weighted sum of the multiple objectives. The weights are then treated as design parameters dependent on the expertise (and preference) of the person performing the learning, with the typical result that a new solution is required for any change in these settings. This paper investigat...