latest | popular

Filter by
VirTex: Learning Visual Representations from Textual Annotations
We train CNN+Transformer from scratch from COCO, transfer the CNN to 6 downstream vision tasks, and exceed ImageNet features despite using 10x fewer ...
convolutional-neural-networks transformers coco visual-representations
projects 1 - 1 of 1
Topic experts
Share a project
Share something you or the community has made with ML.