Markov Decision Processes with Unknown State Feature Values for Safe Exploration using Gaussian Processes

Matthew Budd,Bruno Lacerda,Paul Duckworth,Andrew West,Barry Lennox,Nick Hawes,Matthew Budd,Bruno Lacerda,Paul Duckworth,Andrew West,Barry Lennox,Nick Hawes

When exploring an unknown environment, a mobile robot must decide where to observe next. It must do this whilst minimising the risk of failure, by only exploring areas that it expects to be safe. In this context, safety refers to the robot remaining in regions where critical environment features (e.g. terrain steepness, radiation levels) are within ranges the robot is able to tolerate. More specif...