Optimal Algorithms for Stochastic Multi-Armed Bandits with Heavy Tailed Rewards

Community

Equation

Paper

Search

Agent

Doc

AI Store

Workspace

Register

Login

Optimal Algorithms for Stochastic Multi-Armed Bandits with Heavy Tailed Rewards

NIPS2020

Kyungjae Lee,Hongjun Yang,Sungbin Lim,Songhwai Oh

In this paper, we consider stochastic multi-armed bandits (MABs) with heavy-tailed rewards, whose p-th moment is bounded by a constant nu_p for 1

Discussion

Related Contents