Preference Learning on the Execution of Collaborative Human-Robot Tasks

Thibaut Munzer¹, Marc Toussaint², Manuel Lopes³

¹INRIA
²University of Stuttgart
³Instituto Superior Tecnico

Details

11:30 - 11:35 | Tue 30 May | Room 4611/4612 | TUB6.1

Session: Learning and Adaptive Systems 2

Abstract

We present a novel method to learn human preferences during, and for, the execution of concurrent joint human-robot tasks. We consider tasks realized by a team of a human operator and a robot helper that should adapt to the humans task execution preferences. Different human operators can have different abilities, experiences, and personal preferences, so that a particular allocation of activities in the team is preferred over another. We cast the behavior of concurrent multi-agent cooperation as a semi markov decision process and show how to model and learn human preferences over the team behavior. After proposing two different interactive learning algorithms, we evaluate them and show that the system can effectively learn and adapt to human preferences