Abstract: Visual grounding (VG) is a critical task that seeks to identify and localize a specific visual region within a given image based on a corresponding referring expression. Existing approaches ...
Abstract: Remote sensing visual grounding (RSVG) aims to accurately localize specific targets in remote sensing (RS) images based on natural language descriptions. However, existing RSVG datasets ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results