Dynamic Decision Frequency with Continuous Options
Amirmohammad Karimi,Jun Jin,Jun Luo,A. Rupam Mahmood,Martin Jagersand,Samuele Tosatto,Amirmohammad Karimi,Jun Jin,Jun Luo,A. Rupam Mahmood,Martin Jagersand,Samuele Tosatto
In classic reinforcement learning algorithms, agents make decisions at discrete and fixed time intervals. The duration between decisions becomes a crucial hyperparameter, as setting it too short may increase the problem's difficulty by requiring the agent to make numerous decisions to achieve its goal while setting it too long can result in the agent losing control over the system. However, physic...