Towards Open-World Interactive Disambiguation for Robotic Grasping
Yuchen Mo,Hanbo Zhang,Tao Kong,Yuchen Mo,Hanbo Zhang,Tao Kong
Language-based communications are essential in human-robot interaction, especially for the majority of non-expert users. In this paper, we present SeeAsk, an open-world interactive visual grounding system to grasp specified targets with ambiguous natural language instructions. The main contribution of SeeAsk is that it can robustly handle open-world scenes in terms of both open-set objects and ope...


