Embodied Referring Expression for Manipulation Question Answering in Interactive Environment

Qie Sima,Sinan Tan,Huaping Liu,Fuchun Sun,Weifeng Xu,Ling Fu,Qie Sima,Sinan Tan,Huaping Liu,Fuchun Sun,Weifeng Xu,Ling Fu

Embodied agents are expected to perform more complicated tasks in an interactive environment, with the progress of Embodied AI in recent years. Existing embodied tasks including Embodied Referring Expression (ERE) and other QA-form tasks mainly focuses on interaction in term of linguistic instruction. Therefore, enabling the agent to manipulate objects in the environment for exploration actively h...