A Joint Network for Grasp Detection Conditioned on Natural Language Commands

Yiye Chen,Ruinian Xu,Yunzhi Lin,Patricio A. Vela,Yiye Chen,Ruinian Xu,Yunzhi Lin,Patricio A. Vela

We consider the task of grasping a target object based on a natural language command query. Previous work primarily focused on localizing the object given the query, which requires a separate grasp detection module to grasp it. The cascaded application of two pipelines incurs errors in overlapping multi-object cases due to ambiguity in the individal outputs. This work proposes a model named Comman...