Robotic Indoor Scene Captioning from Streaming Video
Xinghang Li,Di Guo,Huaping Liu,Fuchun Sun,Xinghang Li,Di Guo,Huaping Liu,Fuchun Sun
Robots are usually equipped with cameras to explore the indoor scene and it is expected that the robot can well describe the scene with natural language. Although some great success has been achieved in image and video captioning technology, especially on many public datasets, the caption generated from indoor scene video is still not informative and coherent enough. In this paper, we propose the ...