ViT
[논문 리뷰] An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (ViT)
·625 words·3 mins
CV
ViT
Transformer
Vision Transformer
A review of Vision Transformer (ViT), highlighting its innovative use of Transformers for image recognition, excelling on large datasets with efficiency.