Motion2Vec: Semi-Supervised Representation Learning from Surgical Videos

Ajay Kumar Tanwani,Pierre Sermanet,Andy Yan,Raghav Anand,Mariano Phielipp,Ken Goldberg,Ajay Kumar Tanwani,Pierre Sermanet,Andy Yan,Raghav Anand,Mariano Phielipp,Ken Goldberg

Learning meaningful visual representations in an embedding space can facilitate generalization in downstream tasks such as action segmentation and imitation. In this paper, we learn a motion-centric representation of surgical video demonstrations by grouping them into action segments/subgoals/options in a semi-supervised manner. We present Motion2Vec, an algorithm that learns a deep embedding feat...