The Show and Tell model is an example of an encoder-decoder neural network. It works by first "encoding" an image into a fixed-length vector representation, and then "decoding" the representation into a natural language description.

Don't forget to tag @tensorflow in your comment, otherwise they may not be notified.

Authors
Share this project
Similar projects
VirTex: Learning Visual Representations from Textual Annotations
We train CNN+Transformer from scratch from COCO, transfer the CNN to 6 downstream vision tasks, and exceed ImageNet features despite using 10x fewer ...
Deep Tutorials for PyTorch
This is a series of in-depth tutorials I'm writing for implementing cool deep learning models on your own with the amazing PyTorch library.
Lecture 10 | Recurrent Neural Networks
Discuss the use of recurrent neural networks for modeling sequence data.
Hugging Captions
Generate realistic instagram worthy captions using transformers given a hasthtag and a small text snippet.
Top collections