Guided by the Way: The Role of On-the-route Objects and Scene Text in Enhancing Outdoor Navigation
Yanjun Sun,Yue Qiu,Yoshimitsu Aoki,Hirokatsu Kataoka,Yanjun Sun,Yue Qiu,Yoshimitsu Aoki,Hirokatsu Kataoka
In outdoor environments, Vision-and-Language Navigation (VLN) requires an agent to rely on multi-modal cues from real-world urban environments and natural language instructions. While existing outdoor VLN models predict actions using a combination of panorama and instruction features, this approach ignores objects in the environment and learns data bias to fail navigation. According to our prelimi...