ANSEL Photobot: A Robot Event Photographer with Semantic Intelligence
Dmitriy Rivkin,Gregory Dudek,Nikhil Kakodkar,David Meger,Oliver Limoyo,Michael Jenkin,Xue Liu,Francois Hogan,Dmitriy Rivkin,Gregory Dudek,Nikhil Kakodkar,David Meger,Oliver Limoyo,Michael Jenkin,Xue Liu,Francois Hogan
Our work examines the way in which large language models can be used for robotic planning and sampling in the context of automated photographic documentation. Specifically, we illustrate how to produce a photo-taking robot with an exceptional level of semantic awareness by leveraging recent advances in general purpose language (LM) and vision-language (VLM) models. Given a high-level description o...


