Model-Free Reinforcement Learning for Stochastic Games with Linear Temporal Logic Objectives
Alper Kamil Bozkurt,Yu Wang,Michael M. Zavlanos,Miroslav Pajic,Alper Kamil Bozkurt,Yu Wang,Michael M. Zavlanos,Miroslav Pajic
We study synthesis of control strategies from linear temporal logic (LTL) objectives in unknown environments. We model this problem as a turn-based zero-sum stochastic game between the controller and the environment, where the transition probabilities and the model topology are fully unknown. The winning condition for the controller in this game is the satisfaction of the given LTL specification, ...