Self-Supervised Learning for Alignment of Objects and Sound
Xinzhu Liu,Xiaoyu Liu,Di Guo,Huaping Liu,Fuchun Sun,Haibo Min,Xinzhu Liu,Xiaoyu Liu,Di Guo,Huaping Liu,Fuchun Sun,Haibo Min
The sound source separation problem has many useful applications in the field of robotics, such as human-robot interaction, scene understanding, etc. However, it remains a very challenging problem. In this paper, we utilize both visual and audio information of videos to perform the sound source separation task. A self-supervised learning framework is proposed to implement the object detection and ...