Surgical-VQLA:Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery
Long Bai,Mobarakol Islam,Lalithkumar Seenivasan,Hongliang Ren,Long Bai,Mobarakol Islam,Lalithkumar Seenivasan,Hongliang Ren
Despite the availability of computer-aided simulators and recorded videos of surgical procedures, junior residents still heavily rely on experts to answer their queries. However, expert surgeons are often overloaded with clinical and academic workloads and limit their time in answering. For this purpose, we develop a surgical question-answering system to facilitate robot-assisted surgical scene an...


