TemporalFusion: Temporal Motion Reasoning with Multi-Frame Fusion for 6D Object Pose Estimation

Fengjun Mu,Rui Huang,Ao Luo,Xin Li,Jing Qiu,Hong Cheng,Fengjun Mu,Rui Huang,Ao Luo,Xin Li,Jing Qiu,Hong Cheng

6D object pose estimation is an essential task in vision-based robotic grasping and manipulation. Prior works extract spatial features by fusing the RGB image and depth without considering the temporal motion information, limiting their performance in heavy occlusion robotic grasping scenarios. In this paper, we present an end-to-end model named TemporalFusion, which integrates the temporal motion...