CVQVAE (Conditional-Vector-Quantized-Variational-Autoencoder) for text-to-image synthesys.

Pytorch implementation of conditional-VQVAE2 for generating high-fidelity multi-object images based on text captions.

original paper: Generating Diverse High-Fidelity Images with VQ-VAE-2

This implementation is optimized for the MS-COCO dataset (Captions 2014). Currently supports hierarchical VQVAE and PixelSNAIL.

The code was imported from ipynb notebook.

Credits: vqvae_prior.py code adapted from kamenbliznashki

Preprequisites

Downloaded MS-COCO captions dataset

Pytorch >= 1.6

GPU environment - the PixelSNAIL (vqvae_prior.py) is heavy to train especially on high-resolution images

Usage

Train vqvae.py
extract codes
Train vqvae_prior.py
Sample

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.DS_Store		.DS_Store
LICENSE.txt		LICENSE.txt
README.md		README.md
extract_codebook.py		extract_codebook.py
sample.py		sample.py
train_vqvae.py		train_vqvae.py
train_vqvae_prior.py		train_vqvae_prior.py
vqvae.py		vqvae.py
vqvae_prior.py		vqvae_prior.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CVQVAE (Conditional-Vector-Quantized-Variational-Autoencoder) for text-to-image synthesys.

Pytorch implementation of conditional-VQVAE2 for generating high-fidelity multi-object images based on text captions.

Preprequisites

Usage

About

Uh oh!

Releases

Packages

Languages

License

inferense/cvqvae

Folders and files

Latest commit

History

Repository files navigation

CVQVAE (Conditional-Vector-Quantized-Variational-Autoencoder) for text-to-image synthesys.

Pytorch implementation of conditional-VQVAE2 for generating high-fidelity multi-object images based on text captions.

Preprequisites

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages