From “Thumbs Up” to “10 out of 10”: Reconsidering Scalar Feedback in Interactive Reinforcement Learning

Hang Yu,Reuben M. Aronson,Katherine H. Allen,Elaine Schaertl Short,Hang Yu,Reuben M. Aronson,Katherine H. Allen,Elaine Schaertl Short

Learning from human feedback is an effective way to improve robotic learning in exploration-heavy tasks. Compared to the wide application of binary human feedback, scalar human feedback has been used less because it is believed to be noisy and unstable. In this paper, we compare scalar and binary feedback, and demonstrate that scalar feedback benefits learning when properly handled. We collected b...