Using Learning Curve Predictions to Learn from Incorrect Feedback

Taylor A. Kessler Faulkner,Andrea L. Thomaz,Taylor A. Kessler Faulkner,Andrea L. Thomaz

Robots can incorporate data from human teachers when learning new tasks. However, this data can often be noisy, which can cause robots to learn slowly or not at all. One method for learning from human teachers is Human-in-the-loop Reinforcement Learning (HRL), which can combine information from both an environmental reward and external feedback from human teachers. However, many HRL methods assume...