Adaptive Sampling using POMDPs with Domain-Specific Considerations

Gautam Salhotra,Christopher E. Denniston,David A. Caron,Gaurav S. Sukhatme,Gautam Salhotra,Christopher E. Denniston,David A. Caron,Gaurav S. Sukhatme

We investigate improving Monte Carlo Tree Search based solvers for Partially Observable Markov Decision Processes (POMDPs), when applied to adaptive sampling problems. We propose improvements in rollout allocation, the action exploration algorithm, and plan commitment. The first allocates a different number of rollouts depending on how many actions the agent has taken in an episode. We find that r...