Cross-Modality Time-Variant Relation Learning for Generating Dynamic Scene Graphs
Jingyi Wang,Jinfa Huang,Can Zhang,Zhidong Deng,Jingyi Wang,Jinfa Huang,Can Zhang,Zhidong Deng
Dynamic scene graphs generated from video clips could help enhance the semantic visual understanding in a wide range of challenging tasks such as environmental perception, autonomous navigation, and task planning of self-driving vehicles and mobile robots. In the process of temporal and spatial modeling during dynamic scene graph generation, it is particularly intractable to learn time-variant rel...


