News

A doctored image has been repeatedly shared in social media posts claiming it shows US President Donald Trump reacting angrily to news of former South Korean leader Yoon Suk Yeol's detention. The ...
The algorithm aligns lecture videos with corresponding slides with a multimodal algorithm that uses audio, OCR and image features all together. The approach uses dynamic programming to include a ...
Recent text-to-image (T2I) generation models have advanced significantly, enabling the creation of high-fidelity images from textual prompts. However, existing evaluation benchmarks primarily focus on ...
Abstract: In the past few years, there has been significant progress in hyperspectral image classification (HSIC). However, when the trained classifier on the source scene is directly applied to a new ...
Abstract: Pre-trained vision-language models (VLMs) and language models (LMs) have recently garnered significant attention due to their remarkable ability to represent textual concepts, opening up new ...