Abstract: Remote sensing visual question answering (RSVQA) leverages vision and natural language for intelligent Earth observation. While multimodal large language models (LLMs) demonstrate broad ...