Cheatsheet of Latex Code for Reinforcement Learning Equations

rockingdingo 2024-08-25 23:05 #rl #reinforcement learning

Navigation

In this blog, we will summarize the latex code of most fundamental equations of reinforcement learning (RL). This blog will cover many topics, including Bellman Equation, Markov Decision Process(MDP), Partial Observable Markov Decision Process(POMDP), DQN, A3C, etc.

1. Reinforcement learning

1.1 Bellman Equation

1. Reinforcement learning

1.1 Bellman Equation

Equation

$v_{\pi}(s)=\sum_{a}\pi(a|s)\sum_{s^{'},r}p(s^{'},r|s,a)[r+\gamma v_{\pi}(s^{'})]$

Latex Code

v_{\pi}(s)=\sum_{a}\pi(a|s)\sum_{s^{'},r}p(s^{'},r|s,a)[r+\gamma v_{\pi}(s^{'})]

Explanation

$v_{\pi}(s)$ : Value at state s in policy \pi

$v_{\pi}(s^{'})$ : Value at state s^{'} in policy \pi

$\pi(a|s)$ : Probability of choosing action a given state s

$r$ : Reward at state s

$\gamma$ : Reward discount factor \gamma

You can check more detailed information of Bellman Equation in this tutorial Introduction to Reinforcement Learning for more details.

1.1 Bellman Equation

Equation

Latex Code

Explanation

Cheatsheet of Latex Code for Reinforcement Learning Equations

Navigation

1. Reinforcement learning

Comments

Write Your Comment

Related Contents