Abstract: Pretrained Vision-Language Models (VLMs) have served as excellent foundation models for transfer learning in diverse downstream tasks. However, tuning VLMs for few-shot generalization tasks ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results