Abstract: This paper explores zero-shot Vision-and-Language Navigation (VLN), enabling agents to generalize navigation to unseen data classes. Most current approaches rely on large models, but these ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results