Depth Is All You Need for Monocular 3D Detection

Dennis Park,Jie Li,Dian Chen,Vitor Guizilini,Adrien Gaidon,Dennis Park,Jie Li,Dian Chen,Vitor Guizilini,Adrien Gaidon

A key contributor to recent progress in 3D detection from single images is monocular depth estimation. Existing methods focus on how to leverage depth explicitly, by generating pseudo-pointclouds or providing attention cues for image features. More recent works leverage depth prediction as a pretraining task and fine-tune the depth representation while training it for 3D detection. However, the ad...