Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Kanishk Jain,Varun Chhangani,Amogh Tiwari,K. Madhava Krishna,Vineet Gandhi,Kanishk Jain,Varun Chhangani,Amogh Tiwari,K. Madhava Krishna,Vineet Gandhi
We investigate the Vision-and-Language Navigation (VLN) problem in the context of autonomous driving in outdoor settings. We solve the problem by explicitly grounding the navigable regions corresponding to the textual command. At each timestamp, the model predicts a segmentation mask corresponding to the intermediate or the final navigable region. Our work contrasts with existing efforts in VLN, w...


