Adversarial Latent Autoencoders
Introducing the Adversarial Latent Autoencoder (ALAE), a general architecture that can leverage recent improvements on GAN training procedures.
autoencoders generative-adversarial-networks latent-space disentanglement
STEFANN: Scene Text Editor using Font Adaptive Neural Network
A generalized method for realistic modification of textual content present in a scene image. ⭐️ Accepted in CVPR 2020.
scene-image scene-text-editor font-adaptive font-generation
GANSpace: Discovering Interpretable GAN Controls
This paper describes a simple technique to analyze Generative Adversarial Networks (GANs) and create interpretable controls for image synthesis.
generative-adversarial-networks image-generation interpretability interpretable-gans
GANs in Computer Vision Free Ebook / Article-series
This free ebook/article-series follows the chronological order of 20 peer-reviewed highly-cited papers as they presented in a series of 6 articles.
generative-adversarial-networks image-generation computer-vision paper
TailorGAN: Making User-Defined Fashion Designs
Generate a photo-realistic image which combines the texture from reference A and the new attribute from reference B.
image-generation fashion design generative-adversarial-networks
In-Domain GAN Inversion for Real Image Editing
We propose an in-domain GAN inversion method, which faithfully reconstructs the input image but also ensures the inverted code to be semantically ...
generative-adversarial-networks inversion image-editing image-generation
3D Photography using Context-aware Layered Depth Inpainting
A multi-layer representation for novel view synthesis that contains hallucinated color and depth structures in regions occluded in the original view.
3d image-generation inpainting design
Multifactor Disentanglement and Encoding for Conditional Image Generation
image-generation disentanglement generative-models generative-adversarial-networks
Generative Modeling with Sparse Transformers
Sparse Transformer, a deep neural network which sets new records at predicting what comes next in a sequence—whether text, images, or sound.
transformers sparse-transformers image-generation music-generation
