Improving Unimodal Object Recognition with Multimodal Contrastive Learning

Johannes Meyer,Andreas Eitel,Thomas Brox,Wolfram Burgard,Johannes Meyer,Andreas Eitel,Thomas Brox,Wolfram Burgard

Robots perceive their environment using various sensor modalities, e.g., vision, depth, sound or touch. Each modality provides complementary information for perception. However, while it can be assumed that all modalities are available for training, when deploying the robot in real-world scenarios the sensor setup often varies. In order to gain flexibility with respect to the deployed sensor setup...