Optimizing Algorithms from Pairwise User Preferences

robot,IROS 2023

Leonid Keselman,Katherine Shih,Martial Hebert,Aaron Steinfeld,Leonid Keselman,Katherine Shih,Martial Hebert,Aaron Steinfeld

Typical black-box optimization approaches in robotics focus on learning from metric scores. However, that is not always possible, as not all developers have ground truth available. Learning appropriate robot behavior in human-centric contexts often requires querying users, who typically cannot provide precise metric scores. Existing approaches leverage human feedback in an attempt to model an impl...

Optimizing Algorithms from Pairwise User Preferences

Leonid Keselman,Katherine Shih,Martial Hebert,Aaron Steinfeld,Leonid Keselman,Katherine Shih,Martial Hebert,Aaron Steinfeld

Discussion

Related Contents