LAMP: Leveraging Language Prompts for Multi-Person Pose Estimation
Shengnan Hu,Ce Zheng,Zixiang Zhou,Chen Chen,Gita Sukthankar,Shengnan Hu,Ce Zheng,Zixiang Zhou,Chen Chen,Gita Sukthankar
Human-centric visual understanding is an important desideratum for effective human-robot interaction. In order to navigate crowded public places, social robots must be able to interpret the activity of the surrounding humans. This paper addresses one key aspect of human-centric visual understanding, multi-person pose estimation. Achieving good performance on multi-person pose estimation in crowded...