latest | popular

Filter by
Deep Tutorials for PyTorch
This is a series of in-depth tutorials I'm writing for implementing cool deep learning models on your own with the amazing PyTorch library.
image-captioning sequence-labeling object-detection text-classification
VirTex: Learning Visual Representations from Textual Annotations
We train CNN+Transformer from scratch from COCO, transfer the CNN to 6 downstream vision tasks, and exceed ImageNet features despite using 10x fewer ...
convolutional-neural-networks transformers coco visual-representations
Hugging Captions
Generate realistic instagram worthy captions using transformers given a hasthtag and a small text snippet.
text-generation transformers huggingface instagram
Video object grounding
Video object grounding using semantic roles in language description.
grounding visual-grounding video video-object-grounding
Lecture 10 | Recurrent Neural Networks
Discuss the use of recurrent neural networks for modeling sequence data.
recurrent-neural-networks gated-recurrent-units lstm language-modeling
ViLBERT-MT: Multi-Task Vision & Language Representation Learning
A single ViLBERT Multi-Task model can perform 8 different vision and language tasks learnt from 12 datasets!
visual-question-answering image-captioning multi-modal computer-vision
Show, Infer & Tell: Contextual Inference for Creative Captioning
The beauty of the work lies in the way it architects the fundamental idea that humans look at the overall image and then individual pieces of it.
image-captioning deep-learning recurrent-neural-networks attention
projects 1 - 9 of 9
Topic experts
Share a project
Share something you or the community has made with ML.