↓Skip to main content

ViT

[논문 리뷰] An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (ViT)

21 April 2025·625 words·3 mins

CV ViT Transformer Vision Transformer

A review of Vision Transformer (ViT), highlighting its innovative use of Transformers for image recognition, excelling on large datasets with efficiency.