T3VIP: Transformation-based 3D Video Prediction
Iman Nematollahi,Erick Rosete-Beas,Seyed Mahdi B. Azad,Raghu Rajan,Frank Hutter,Wolfram Burgard,Iman Nematollahi,Erick Rosete-Beas,Seyed Mahdi B. Azad,Raghu Rajan,Frank Hutter,Wolfram Burgard
For autonomous skill acquisition, robots have to learn about the physical rules governing the 3D world dynamics from their own past experience to predict and reason about plausible future outcomes. To this end, we propose a transformation-based 3D video prediction (T3VIP) approach that explicitly models the 3D motion by decomposing a scene into its object parts and predicting their corresponding r...