Perceive, Represent, Generate: Translating Multimodal Information to Robotic Motion Trajectories

Fábio Vital,Miguel Vasco,Alberto Sardinha,Francisco Melo,Fábio Vital,Miguel Vasco,Alberto Sardinha,Francisco Melo

We present Perceive-Represent-Generate (PRG), a novel three-stage framework that maps perceptual information of different modalities (e.g., visual or sound), corresponding to a series of instructions, to a sequence of movements to be executed by a robot. In the first stage, we perceive and preprocess the given inputs, isolating individual commands from the complete instruction provided by a human ...