Baseline Text2Interaction
Baseline Text2Interaction

Different users have varying preferences. This naturally applies to robotics as well! Adjusting robot behavior to human preferences can require intensive human feedback, which prevents a quick adaptation to new users and changing circumstances.

To integrate new user preferences in a zero-shot manner, we propose Text2Interaction. We use foundation models to generate task plans, motion preferences as Python code, and parameters of a safe controller that align with the user preferences. By maximizing the combined probability of task completion and user satisfaction, we can reliably find plans that fulfill both requirements. In our real-world user study, we found that 83% of users think that it integrates their preferences into the robot’s plan, and 94% prefer Text2Interaction over the baseline.

Publications: