COMPASS: Contrastive Multimodal Pretraining for Autonomous Systems
Shuang Ma,Sai Vemprala,Wenshan Wang,Jayesh K. Gupta,Yale Song,Daniel McDufft,Ashish Kapoor,Shuang Ma,Sai Vemprala,Wenshan Wang,Jayesh K. Gupta,Yale Song,Daniel McDufft,Ashish Kapoor
Learning representations that generalize across tasks and domains is challenging yet necessary for autonomous systems. Although task-driven approaches are appealing, de-signing models specific to each application can be difficult in the face of limited data, especially when dealing with highly variable multimodal input spaces arising from different tasks in different environments. We introduce the...