Active Reward Learning for Co-Robotic Vision Based Exploration in Bandwidth Limited Environments

Stewart Jamieson,Jonathan P. How,Yogesh Girdhar,Stewart Jamieson,Jonathan P. How,Yogesh Girdhar

We present a novel POMDP problem formulation for a robot that must autonomously decide where to go to collect new and scientifically relevant images given a limited ability to communicate with its human operator. From this formulation we derive constraints and design principles for the observation model, reward model, and communication strategy of such a robot, exploring techniques to deal with th...