News
Image Captioning with Transformer This project applies Transformer-based model for Image captioning task. In this study project, most of the work are reimplemented, some are adapted with lots of ...
Alexey Dosovitskiy et al. introduce an image classification model (ViT) based on a classical transformer encoder showing a good performance [6]. Based on ViT, Wei Liu et al. present an image ...
CNN Encoder: EfficientNetB0 - The CNN encoder is applied to extract high-level features from images. Then a Transformer encoder-decoder model is used to generate word-by-word captions. The ...
Image segmentation is crucial in many fields, but existing image segmentation models based on encoder-decoder networks are constrained by manual parameter tuning and the limited applicability of the ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results